The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
It is our pleasure to introduce you to the Proceedings of the International Conference on Knowledge Engineering and Applications (ICKEA 2016) held in NTU, Singapore on September 28–30, 2016. Interest in knowledge engineering as well as its applications has remarkably increased in recent years. Novel solutions of complex knowledge technology, such as advanced knowledge technologies, knowledge relationship...
In this paper, we propose a novel text categorization method based on modified Support Vector Clustering (SVC). SVC is a density based clustering approach, which handles the arbitrary shape clusters effectively. The main drawback of traditional SVC is that it treats unclassified documents as outliers. To overcome this problem, we employed Fuzzy C-Means (FCM) to cluster unclassified documents. The...
A variety of data dependencies have been proposed for data cleaning, including conditional functional dependencies, editing rules and so on. Fixing rule is a newly proposed class of data dependencies for data repairing with high accuracy. However, to our knowledge, algorithms for automatically designing fixing rules have not been developed. In this paper, a workflow of generating fixing rules has...
Prior studies point out information uncertainty (IU) of a stock comes from investors' overconfidence. However, overconfidence is hard to be measured. After our collections and reviews of prior studies regarding IU, we propose stock returns are affected by investors' sentiment, and this relation is moderated by IU. We use SentimentAnalyzer system, a text mining technology, to analyze the posts in a...
The exact method JOCOR, proposed by Mueen et al., is the first method for joining two time series on subsequence correlation. Although JOCOR requires the time complexity O(n2lgn), where n is the length of the time series, it is still time-consuming even for medium-size time series. In this paper, we propose a hybrid method which can run faster than JOCOR. Our method consists of four main steps. First,...
Mining recurring patterns plays an important role in finding regularities for particular time intervals within a time series. Most existing methods on recurring pattern search typically focus on finding the patterns that exhibit temporal associations among items within a transaction (timestamp). However, present methods for mining recurring patterns are not expressive enough to capture recurring events...
In this paper, we first give a comprehensive summary of models and algorithms applied in three online job recommender systems and point out the advantages and disadvantages of these models. Then we introduce a job recommendation model based on Gradient Boosting Regression Tree and time factors (T-GBRT). The T-GBRT model aggregates the time factors into the GBRT to predict personal preferences and...
Managing Service Level Agreement (SLA) for composite semantic web services is a very complex process, as it involves many complex tasks such as components discovery, provisioning, monitoring, recovery, and coordination. Indeed, managing all these tasks manually is a very cumbersome operation, nevertheless it is time consuming and prone to errors. To overcome such problems, automated SLA management...
As the social impact of science and technology is increasing day by day, it is increasing the importance of science and technology policy. The Third Science and Technology Basic Plan, which is to be the basis of Korea Science and Technology Policy, will be established in 2013. There is need to investigate the value-oriented and policy priorities of the Science and Technology Basic Plan. The purpose...
Intelligent information applications (e.g. healthcare, business data mining, etc.) usually involve the processing of a huge amount of data. MapReduce can speed up the execution of the application (job) with big data by dividing the job into a number of concurrently running map and reduce tasks in cloud computing systems. With many MapReduce jobs in the systems, it is required to efficiently allocate...
This paper presents an application for counting the people who pass through the supervised area. Instead of traditional camera, this study used Kinect 2 to get the depth information of image. The processes of our approach includes preprocessing, candidate detection, tracking, identification and people counting. In the preprocessing stage, the foreground object was sliced by depth information to make...
Information system security in a company is an important element that every company should pay more attention due to the attacks against the security of the data that may not be inevitable. Probably every company knows how to protect their data even though this paper proposes something new which is more efficient. One of the ways that can be used to determine the security status of the company is...
With the increased use of today's information technologies, more people are using electronic calendaring systems, such as Outlook, to schedule everyday events. This includes work related items such as meetings as well as personal items such as birthdays and their kids' events. There have been no significant changes to this process. People enter their personal event items on their calendars. If someone...
With the rapid increase in the volume of biomedical publications, developing an efficient search strategy to retrieve relevant biomedical documents that match the user search intention is a tremendous challenge. This paper proposes a novel pseudo relevance feedback technique which combines MeSH terms and UMLS concepts to improve the performance of retrieving biomedical documents from MEDLINE. The...
Taobao is a network retailer which founded in May 2003 and now is the most popular online retail platform in China with nearly 500 million registered users. More than 60 million people visit Taobao everyday and over 48000 items are sold every minute on this platform. During the expansion progress, Taobao has transformed from a C2C network market into a worldwide E-commerce trading platform including...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.