The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Relation extraction is a challenging task in biomedical text mining due to the complex of sentences in the biomedical literature. In this paper, we address multi-class relationship extraction problem from biomedical literature using Maximum Entropy model with simple word features. The proposed method is applied to extract the protein-protein interactions. Experiments show the method achieves an accuracy...
Classification based on predictive association rules (CPAR) is a kind of association classification methods which combines the advantages of both associative classification and traditional rule-based classification. For rule generation, CPAR is more efficient than traditional rule-based classification because much repeated calculation is avoided and multiple literals can be selected to generate multiple...
Recently, the Web has been the data repository. In order to obtain the relevant information from the repository, many research have been made. The typical function of Web news extraction is to locate the useful content text and filter the noises , both main issues result in Web news extraction that is an open research problem. In this paper , we describe an approach that can cluster the pages which...
Unknown word recognition is a very important problem in natural language processing. It has a great influence on the performance of dictionary construction and word segmentation. This paper introduces two methods to improve the effect of Chinese unknown word recognition by using Conditional Random Fields: the rough label of the characters and the N-best listing. The CRF with the two methods proposed...
A sentence-based Chinese text input method system is proposed in this paper, which is implemented on both Symbian S60 and Windows Mobile platform with such characters as easy-to-use, efficient and smart. The whole system is compacted within 150 k, and can be integrated with cell phone, PDA and remoter.
Most of the research in last few decades has focused on automatic natural language processing (NLP) in English, European and East Asian languages. But unfortunately South Asian languages especially Urdu have received less attention. In this paper we present a survey regarding classification of Urdu language. The main goal of this survey is to present briefly about the material available on Urdu NLP,...
This paper introduces the features of Symbian OS and system composing, especially with the development experience on the sentence-level (mobile telephone) intelligent pinyin Chinese input method, discusses the implementation of FEP (front end processors) based on Symbian OS. Through the test of and T9 input methods, we found that input method is greatly fit for the demand of large text processing...
Semantic search requires a search engine to properly interpret the meaning of a user's query and the inherent relations among the terms that a document contains with respect to a specific domain. We present the framework of such a search engine based on domain ontologies. In this framework, a search request, which can be either a keyword list as in traditional search methods or a query in complex...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.