10th International Conference on Intelligent Systems Design and Applications (ISDA 2010)

Items from 1 to 9 out of 9 results

chapter

The development of cross-language plagiarism detection tool utilising fuzzy swarm-based summarisation

S Alzahrani, N Salim, C K Kent, M S Binwahlan, more

2010 10th International Conference on Intelligent Systems Design and Applications > 86 - 90

10th International Conference on Intelligent Systems Design and Applications (ISDA 2010)

This work presents the design and development of a web-based system that supports cross-language similarity analysis and plagiarism detection. A suspicious document d_q in a language L_q is to be submitted to the system via a PHP web-based interface. The system will accept the text through either uploading or pasting it directly to a text-area. In order to lighten large texts and provide an ideal set...

chapter

Mining pharmaceutical spam from Twitter

Chandra Shekar, Shruti Wakade, Kathy J Liszka, Chien-Chung Chan

2010 10th International Conference on Intelligent Systems Design and Applications > 813 - 817

10th International Conference on Intelligent Systems Design and Applications (ISDA 2010)

This paper presents a method of applying text mining techniques and data mining tools for pharmaceutical spam detection from Twitter data. A simple method based on a manually selected list of 65 pharmaceutical discriminating words is used for labeling spam training tweets. Preliminary experimental results show that J48 decision tree classifier has better performance over Naïve Bayesian algorithm.

chapter

Automatic extraction and classification approach of opinions in texts

Rihab Bouchlaghem, Aymen Elkhlifi, Rim Faiz

2010 10th International Conference on Intelligent Systems Design and Applications > 918 - 922

10th International Conference on Intelligent Systems Design and Applications (ISDA 2010)

In this paper, we present an approach to automatically extract and classify opinions in texts. We propose a similarity measurement calculating semantically distances between a word and predefined subgroups of seed words. We have evaluated our algorithm on the semantic evaluation company “SemEval 2007” corpus, and we obtained the best value of Precision and F1 62% and 61%. As an improvement of 20 %...

chapter

Omni font OCR error correction with effect on retrieval

W Magdy, K Darwish

2010 10th International Conference on Intelligent Systems Design and Applications > 415 - 420

10th International Conference on Intelligent Systems Design and Applications (ISDA 2010)

Recent library digitization projects attempt to provide large collections of printed material from varying sources in a searchable format. The scanned documents are typically processed using Optical Character Recognition (OCR), which typically introduces errors in the text. This paper proposes a technique for correction of OCR degraded text that is independent of character-level OCR errors, and hence...

chapter

Recommendation by composition style

S Karmakar, Ying Zhu

2010 10th International Conference on Intelligent Systems Design and Applications > 818 - 822

10th International Conference on Intelligent Systems Design and Applications (ISDA 2010)

Composition style is often an important factor in readers' selection of reading materials. For example, a reader may seek out articles written in similar style as his or her favorite writer. We present a new method for providing recommendations based on the composition style. Our algorithm analyzes and encodes the readability index and syntactical structure of a model document, and then searches for...

chapter

Towards developing an Arabic word alignment annotation tool with some Arabic alignment guidelines

Hisham A Kholidy, N Chatterjee

2010 10th International Conference on Intelligent Systems Design and Applications > 778 - 783

10th International Conference on Intelligent Systems Design and Applications (ISDA 2010)

Word Alignment is an important supporting task for different NLP applications like training of machine translation systems, translation lexicon induction, word sense discovery, word sense disambiguation, information extraction and the cross-lingual projection of linguistic information. In this paper we study the main rules and guidelines required to build an aligner tool for Arabic language which...

chapter

Determination of Bloom's cognitive level of question items using artificial neural network

Norazah Yusof, Chai Jing Hui

2010 10th International Conference on Intelligent Systems Design and Applications > 866 - 870

10th International Conference on Intelligent Systems Design and Applications (ISDA 2010)

We propose a classification model for the cognitive level of question items in examinations based on Bloom's taxonomy. The model implements the artificial neural network approach, which is trained using the scaled conjugate gradient learning algorithm. Several data preprocessing techniques such as word extraction, stop word removal, stemming, and vector representation are applied to a feature set...

chapter

Wiki-rec: A semantic-based recommendation system using Wikipedia as an ontology

Ahmed Elgohary, Hussein Nomir, Ibrahim Sabek, Mohamed Samir, more

2010 10th International Conference on Intelligent Systems Design and Applications > 1465 - 1470

10th International Conference on Intelligent Systems Design and Applications (ISDA 2010)

Nowadays, satisfying user needs has become the main challenge in a variety of web applications. Recommender systems play a major role in that direction. However, as most of the information is present in a textual form, recommender systems face the challenge of efficiently analyzing huge amounts of text. The usage of semantic-based analysis has gained much interest in recent years. The emergence of...

chapter

Improving Arabic document categorization: Introducing local stem

Eiman Tamah Al-Shammari

2010 10th International Conference on Intelligent Systems Design and Applications > 385 - 390

10th International Conference on Intelligent Systems Design and Applications (ISDA 2010)

Stemming is a fundamental step in processing textual data preceding the tasks of text mining, Information Retrieval (IR), and natural language processing (NLP). The common goal of stemming is to standardize words by reducing a word to its base (root or stem), thus can be also considered a feature reduction technique. This paper aims at presenting a new dictionary free, content-based Arabic stemmer...

Filter options

Keywords:
TEXT ANALYSIS

Publication date

Set your own date range

Keywords

PATTERN CLASSIFICATION (4)
DATA MINING (3)
INTELLIGENT SYSTEMS (3)
INTERNET (3)
NATURAL LANGUAGE PROCESSING (3)
SEMANTICS (3)
TRAINING (3)
WORD PROCESSING (3)
ACCURACY (2)
ARTIFICIAL NEURAL NETWORKS (2)
CLASSIFICATION ALGORITHMS (2)
COLLABORATION (2)
FEATURE EXTRACTION (2)
GOOGLE (2)
RECOMMENDER SYSTEMS (2)
STEMMING (2)
TEXT CATEGORIZATION (2)
ALGORITHM DESIGN AND ANALYSIS (1)
ALIGNER TOOL (1)
ALIGNMENT ERRORS (1)
ANNOTATIONS (1)
ARABIC ALIGNMENT GUIDELINES (1)
ARABIC ALIGNMENT TOOL (1)
ARABIC CORPUS ANALYSIS (1)
ARABIC DOCUMENT CATEGORIZATION (1)
ARABIC LANGUAGE (1)
ARABIC STEMMING ALGORITHMS (1)
ARABIC TEXT (1)
ARABIC TEXT CATEGORIZATION (1)
ARABIC WORD ALIGNMENT ANNOTATION TOOL (1)
ARTIFICIAL NEURAL NETWORK (1)
AUTOMATIC EXTRACTION (1)
BLOGS (1)
BLOOM'S COGNITIVE LEVEL (1)
BLOOMS TAXONOMY (1)
CLASSIFICATION (1)
CLASSIFICATION MODEL (1)
COMPLEXITY THEORY (1)
COMPOSITION STYLE (1)
COMPUTATIONAL MODELING (1)
COMPUTER SCIENCE (1)
CONFERENCES (1)
CONJUGATE GRADIENT LEARNING ALGORITHM (1)
CONJUGATE GRADIENT METHODS (1)
CONTENT RECOMMENDATION SYSTEM (1)
CONTENT-BASED ARABIC STEMMER (1)
CONVERGENCE (1)
CROSS-LANGUAGE (1)
CROSS-LANGUAGE PLAGIARISM DETECTION (1)
CROSS-LANGUAGE SIMILARITY ANALYSIS (1)
CULTURAL DIFFERENCES (1)
DATA PREPROCESSING TECHNIQUES (1)
DECISION TREES (1)
DEGRADATION (1)
DICTIONARIES (1)
DICTIONARY FREE ARABIC STEMMER (1)
DICTIONARY-BASED TRANSLATION (1)
DOCUMENT FREQUENCY (1)
DOCUMENT IMAGE PROCESSING (1)
EDUCATIONAL INSTITUTIONS (1)
ELECTRONIC PUBLISHING (1)
ENCYCLOPEDIAS (1)
ERROR CORRECTION (1)
FEATURE REDUCTION METHODS (1)
FEATURE REDUCTION TECHNIQUE (1)
FEATURE VECTOR (1)
FILTERING (1)
FUZZY SWARM-BASED SUMMARISATION (1)
GUIDELINES (1)
HIDDEN MARKOV MODELS (1)
INDEXING (1)
INFORMATION FILTERING (1)
INFORMATION RETRIEVAL (1)
J48 DECISION TREE CLASSIFIER (1)
JAVA (1)
KEYWORD BASED SEARCH (1)
LABELING (1)
LANGUAGE MODELING (1)
LANGUAGE TRANSLATION (1)
LEARNING (ARTIFICIAL INTELLIGENCE) (1)
LIBRARY DIGITIZATION PROJECTS (1)
LIGHT STEMMING MECHANISM (1)
MACHINE LEARNING (1)
MATRIX (1)
MODEL DOCUMENT (1)
NAIVE BAYESIAN ALGORITHM (1)
NEURAL NETS (1)
NIOBIUM (1)
NLP APPLICATIONS (1)
OCR (1)
OMNI FONT OCR ERROR CORRECTION (1)
ONTOLOGIES (1)
ONTOLOGIES (ARTIFICIAL INTELLIGENCE) (1)
ONTOLOGY (1)
ONTOLOGY-BASED ANALYSIS (1)
OPINION MINING (1)
OPINIONS CLASSIFICATION (1)
OPTICAL CHARACTER RECOGNITION (1)
OPTICAL CHARACTER RECOGNITION SOFTWARE (1)
more

INFONA - science communication portal

10th International Conference on Intelligent Systems Design and Applications (ISDA 2010) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

10th International Conference on Intelligent Systems Design and Applications (ISDA 2010)