The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Data quality plays an important role in modern intelligent information system and is crucial to any data analysis task. Many imperfection-handling techniques avoid overfitting or simply remove offending portions of the data. Data correction can help to retain and recover as much information as possible from the original data resources. In this paper, we proposed a novel technique based on polynomial...
The graphics processing unit (GPU) has evolved into a key part of today's heterogeneous parallel computing architecture. A number of influential data mining algorithms have been parallelized on GPUs including frequent pattern mining algorithms, such as Apriori. Unfortunately, due to two major challenges, the more effective method for mining frequent patterns without candidate generation named FP-Growth...
Matchmaking is an everlasting topic in human society. In this paper, we focus on human attractiveness in matchmaking for the new generation of individuals in China based on If You Are The One, a famous dating game show in China. We collect a comprehensive contestant dataset, including numeric, text and image attributes. We investigate the influence of various factors on human attractiveness using...
The issue of incomplete data exists across the entire field of data mining. In this paper, a novel two-phase method is developed to deal with the challenge of incomplete data on classification problems. In phase I, the dataset is divided into disjoint subsets based on the attributes with missing values. In phase II, each subset is used to train appropriate classification algorithms respectively in...
Temporal information is an important characteristic of event. It can be used in information retrieval process to organize the returned result. In Chinese, the presentations of time expression are very complex, which make it difficult to both accurately recognize a time expression and precisely connecting it with a given event in a Web page that contains multiple events. To address these problems,...
Alzheimer's disease (AD) is one of the most common forms of dementia and has become a serious issue among the elderly in the aging society. Since AD is incurable and degenerative, early diagnosis is essential, which can give patients and their family more opportunities to arrange their lives. In the meantime, histopathologic studies have found that MCI (mild cognitive impairment) subjects usually...
Rare class problems exist extensively in real-world applications across a wide range of domains. The extreme scarcity of the target class challenges traditional machine learning algorithms focusing on the overall classification accuracy. As a result, purposefully designed techniques are required for effectively solving the rare class mining problem. This paper presents a systematic review of the major...
High dimensional and high-resolution gene expression data generated by the in situ hybridization (ISH) technique provides biologists a powerful tool to study gene functions. Nevertheless, a major challenge in analyzing such data is how to efficiently retrieve genes showing certain spatial expression patterns and/or genes showing similar expression patterns as the query gene. The development of a fast...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.