Page 180 - Kỷ yếu hội thảo khoa học lần thứ 12 - Công nghệ thông tin và Ứng dụng trong các lĩnh vực (CITA 2023)
P. 180

164








                     Equation  (7)  can  be  used  as  a  scoring  function  to  measure  the  quality  of  a  tree
                     structure q. This score is the same as the classification score for evaluating decision
                     trees, except that it is computed for a wider range of objective functions.
                       In  fact,  the  proven  XGBoost  algorithm  optimizes  speed  and  performance  for
                     building predictive models. At the same time, the XGBoost algorithm uses a variety
                     of data formats, including tabular data of different sizes and layered data types. The
                     algorithm of XGBoost is now widely by data scientists, is a scalable machine learning
                     system for tree boosting which can help to avoid over-fitting. It performs well on its
                     own and has been shown to be successful in many machine learning problems.
                       In our model, the XGBoost algorithm was implemented as the tree model is usually
                     used as a primary classifier in XGBoost System. The features extracted from CNN
                     were fed to train and test the XGBoost classifier in this study.



                     4      Results


                     4.1    Dataset

                     This study uses the dataset name as  WEAPD [10] that contained 6,862 images. The
                     authors have collected weather images from various sources. Figure 3 below depicts a
                     few examples of images from this dataset.



































                                            Fig 3. Example of weather images dataset



                     CITA 2023                                                   ISBN: 978-604-80-8083-9
   175   176   177   178   179   180   181   182   183   184   185