مشخصات مقاله | |
انتشار | مقاله سال 2018 |
تعداد صفحات مقاله انگلیسی | 9 صفحه |
هزینه | دانلود مقاله انگلیسی رایگان میباشد. |
منتشر شده در | نشریه الزویر |
نوع مقاله | ISI |
نوع نگارش مقاله | مقاله کوتاه (Short Communication) |
مقاله بیس | این مقاله بیس نمیباشد |
عنوان انگلیسی مقاله | Review on mining data from multiple data sources |
ترجمه عنوان مقاله | مروری بر داده کاوی از منابع اطلاعاتی متعدد |
نمایه (index) |
Scopus – Master Journals – JCR
|
ایمپکت فاکتور(IF) |
2.694 در سال 2018
|
شاخص H_index |
129 در سال 2019
|
شاخص SJR |
0.662 در سال 2017
|
شناسه ISSN |
0167-8655
|
شاخص Quartile (چارک) |
Q1 در سال 2017
|
فرمت مقاله انگلیسی | |
رشته های مرتبط | مهندسی صنایع |
گرایش های مرتبط | داده کاوی |
نوع ارائه مقاله |
ژورنال |
مجله | اسناد تشخیص الگو – Pattern Recognition Letters |
دانشگاه | Institute of Natural and Mathematical Sciences – Massey University – New Zealand |
کلمات کلیدی | داده کاوی منابع چندگانه، تجزیه و تحلیل الگو، طبقه بندی داده ها، خوشه بندی داده ها، تلفیق داده |
کلمات کلیدی انگلیسی |
Multiple data source mining, Pattern analysis, Data classification, Data clustering, Data fusion
|
کد محصول | E5871 |
وضعیت ترجمه مقاله | ترجمه آماده این مقاله موجود نمیباشد. میتوانید از طریق دکمه پایین سفارش دهید. |
دانلود رایگان مقاله | دانلود رایگان مقاله انگلیسی |
سفارش ترجمه این مقاله | سفارش ترجمه این مقاله |
بخشی از متن مقاله: |
1. Introduction
The advancement of information communication technology has generated a large amount of data from different sources, which may be stored in different geological locations. Each database may have its own structure to store data. Mining multiple data sources [1–3] distributed at different geological locations to discover useful patterns are critical important for decision making. In particular, the Internet can be seen as a large, distributed data repository consisting of a variety of data sources and formats, which can provide abundant information and knowledge. Data from different sources may seem irrelevant to each other. Once information generated from different sources is integrated, new and useful knowledge may emerge. Here is an excellent example of how an organization to utilize mining data from different data sources to obtain profound information, which cannot obtain from an individual source. The Australian Taxation Office (ATO) mines data from different data sources such as social media posts, private school records and immigration data to detect tax cheats. Mining data from different data sources become a sophisticated tool to crackdown tax cheats that yielded nearly $10 billion in 2016 [4]. For example, in a normal Australian family, the husband has a business and reported $80,000 of taxable income per year, putting him just inside the second-lowest tax bracket, and his wife reported earning $60,000 per year. But the data collected from different data sources revealed that the family had three children at private schools at an estimated cost of $75,000 per year, while immigration records and social media posts showed that the family had recently taken five business-class flights and a holiday in a Canadian ski resort, Whistler. It means their declared incomes did not match their lifestyle. This prompted ATO to contact them to confirm if they have unpaid taxes. From the above example, we can see that developing an effective data mining technique for mining from multiple data sources to discover useful information is crucially important for decision making. However, how to efficiently mine quality information from multiple data sources is a challenging task for current research [5–9], especially in the current big data era, because in real world applications, data stored in multiple places often have conflictions [10]. The conflictions include: (i) data name conflictions: (a) the same object has different names in different data sources, or (b) two different objects from different data sources may have the same name; (ii) data format conflictions: the same object in different data sources has different data types, domains, scales, and preci sions; (iii) data value confliction: the same object in different data sources records different values; (iv) data sources confliction: different data sources have different database structures |