مشخصات مقاله | |
ترجمه عنوان مقاله | تشخیص تقلب در صورت های مالی با استفاده از مدل های داده کاوی و GAN |
عنوان انگلیسی مقاله | Fraud detection in financial statements using data mining and GAN models |
نشریه | الزویر |
انتشار | مقاله سال 2023 |
تعداد صفحات مقاله انگلیسی | 38 صفحه |
هزینه | دانلود مقاله انگلیسی رایگان میباشد. |
نوع نگارش مقاله |
مقاله پژوهشی (Research Article) |
مقاله بیس | این مقاله بیس میباشد |
نمایه (index) | Scopus – Master Journals List – JCR |
نوع مقاله | ISI |
فرمت مقاله انگلیسی | |
ایمپکت فاکتور(IF) |
10.352 در سال 2022 |
شاخص H_index | 249 در سال 2023 |
شاخص SJR | 1.873 در سال 2022 |
شناسه ISSN | 0957-4174 |
شاخص Quartile (چارک) | Q1 در سال 2022 |
فرضیه | ندارد |
مدل مفهومی | دارد |
پرسشنامه | ندارد |
متغیر | دارد |
رفرنس | دارد |
رشته های مرتبط | حسابداری – مدیریت |
گرایش های مرتبط | حسابداری مالی – حسابداری عمومی – بانکداری – مدیریت مالی – حسابرسی |
نوع ارائه مقاله |
ژورنال |
مجله | سیستم های خبره با کاربردهای آن – Expert Systems with Applications |
دانشگاه | Faculty of Computer Engineering, K. N. Toosi University of Technology, Tehran, Iran |
کلمات کلیدی | تقلب در صورت های مالی – تشخیص ناهنجاری – تولید پرت – شبکه های متخاصم مولد – مدل های مجموعه – بخش بانکی |
کلمات کلیدی انگلیسی | Fraud in financial statements – anomaly detection – outlier generation – generative adversarial networks – ensemble models – banking sector |
شناسه دیجیتال – doi |
https://doi.org/10.1016/j.eswa.2023.120144 |
لینک سایت مرجع | https://www.sciencedirect.com/science/article/abs/pii/S0957417423006462 |
کد محصول | e17423 |
وضعیت ترجمه مقاله | ترجمه آماده این مقاله موجود نمیباشد. میتوانید از طریق دکمه پایین سفارش دهید. |
دانلود رایگان مقاله | دانلود رایگان مقاله انگلیسی |
سفارش ترجمه این مقاله | سفارش ترجمه این مقاله |
فهرست مطالب مقاله: |
Abstract 1 Introduction 2 Background and related work 3 Basic concepts and prerequisites 4 Proposed Method 5 Experimental study 6. Conclusion and Future Work References |
بخشی از متن مقاله: |
Abstract Financial statements are analytical reports published periodically by financial institutions explaining their performance from different perspectives. As these reports are the fundamental source for decision-making by many stakeholders, creditors, investors, and even auditors, some institutions may manipulate them to mislead people and commit fraud. Fraud detection in financial statements aims to discover anomalies caused by these distortions and discriminate fraud-prone reports from non-fraudulent ones. Although binary classification is one of the most popular data mining approaches in this area, it requires a standard labeled dataset, which is often unavailable in the real world due to the rarity of fraudulent samples. This paper proposes a novel approach based on the generative adversarial networks (GAN) and ensemble models that is able to not only resolve the lack of non-fraudulent samples but also handle the high-dimensionality of feature space. A new dataset is also constructed by collecting the annual financial statements of ten Iranian banks and then extracting three types of features suggested in this study. Experimental results on this dataset demonstrate that the proposed method performs well in generating synthetic fraud-prone samples. Moreover, it attains comparative performance with supervised models and better performance than unsupervised ones in accurately distinguishing fraud-prone samples.
Introduction Today, the incremental growth of fraud in business, especially in financial services, has become an earnest and costly problem. There exists no single definition for the concept of fraud in scientific sources. One of the clearest definitions available is the one provided by the Association of Certified Fraud Examiners (ACFE) in 2008. According to this definition, individuals and organizations may commit illegal actions such as deception or betrayal of trust for specific reasons, such as obtaining money, property, or individual or collective benefits, which are interpreted as fraud (Hashim et al., 2020, Sadgali et al., 2019, Syahria, 2019). The American Institute of Certified Public Accountants (AICPA) has also attributed the concept of fraud to any type of fraud, including minor employee theft, unproductive performance, embezzlement, misappropriation of assets, and fraudulent financial reporting (Hashim et al., 2020). As it can be understood from the above definitions, there are variants of fraud, among which this study is focused on fraud in financial statements. Financial statements are reports that detail an organization’s business activities and financial performance from various perspectives (Ashtiani and Raahemi, 2021, Jan, 2018). The most important contents of these reports include expenses, incomes, received or granted loans, profits, and losses (Ashtiani & Raahemi, 2021). These large amounts of numbers and figures provide an opportunity for profit seekers to cheat. Among the most common forms of fraud in financial statements are premature revenue recognition, spurious entries of incomes or profits, overstating assets, understating expenses, and concealment or false disclosure of expenses (Craja et al., 2020, Gray and Debreceny, 2014). According to the ranking provided by ACFE, financial statement fraud is the third most prevalent type of occupational fraud, after corruption and embezzlement (Hashim et al., 2020, Petković et al., 2021, Syahria, 2019). However, it has taken first place regarding the financial costs and the amount of loss it incurs (Omidi et al., 2019). Hence, early detection of this type of fraud can prevent its exorbitant financial consequences.
Conclusion and Future Work In this paper, a new approach has been proposed to detect fraud in bank financial statements. The basic idea is to adopt generative adversarial networks instead of over-sampling, under-sampling, or one-class classification techniques to make the approach applicable to real-world scenarios whose data is highly imbalanced with no or few fraudulent samples. The second idea is to tackle with high-dimensionality of the feature space by leveraging the ensemble of supervised and unsupervised models. In particular, the proposed approach utilizes a kind of generative adversarial model called MO-GAAL to fabricate a set of fraud-prone samples that have unconventional behavior on the one hand and are difficult to distinguish from fraud-free samples on the other hand. Further, samples are classified by an ensemble model, namely XGBOD, in which the outlier scores of each sample are first estimated by a collection of unsupervised models, and then these scores form a new feature vector to be classified by a supervised model named XGBoost. In summary, the ability to train an efficient decision-making model even in the absence of actual fraudulent samples is the main advantage of this work. |