ARABIC TEXT CATEGORIZATION USING ROCCHIO MODEL
Published In: INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, ELECTRONICS AND COMMUNICATION
Author(s): ABDUELBASET GOWEDER , KILIAN STOFFEL , MOHUMMED ELBOASHI
Abstract: Automatic text categorization is considered an important application in natural language processing. It is the process of assigning a document to predefined categories based on its content. In this research, some well-known techniques developed for classifying English text are considered to be applied on Arabic. This work focuses on applying the well-known Rocchio (Centroid-based) technique on Arabic documents. This technique uses centroids to define good class boundaries. The centroid of a class c is computed as center of mass of its members. Arabic language is highly inflectional and derivational which makes text processing a complex task. In the proposed work, first Arabic text is preprocessed using tokenization and stemming techniques. Then, the Rocchio Algorithm is adopted and adapted to be applied to classify Arabic documents. The implemented algorithm is evaluated using a corpus containing a set of actual documents. The results show that the adapted Rocchio algorithm is applicab
- Publication Date: 13-Oct-2013
- DOI: 10.15224/978-981-07-7965-8-15
- Views: 0
- Downloads: 0
NOVEL FIRST RESPONDER SCRIPT AS A TOOL FOR COMPUTER FORENSICS
Published In: INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, ELECTRONICS AND COMMUNICATION
Author(s): ALEKSANDAR RISTESKI , MARJAN STOILKOVKSI , MITKO BOGDANOSKI
Abstract: The computer forensics as a branch of digital forensic pertaining to legal evidence found in computers and digital storage media. In order forensic acquisition to be more reliable it must be performed on computers that have been powered off. This type of forensics is known as ‘traditional’ or \'dead\' forensic acquisition. However, this type of forensic cannot be used to collect and analyze the information which is not on the hard disk, or encrypted data. The disadvantages of the dead forensics can be overcome handling a live forensics acquisition process. There are many commercial and freeware tools which can be used to provide information based on live forensics acquisition. The problem with this tools is that in many cases the examiner cannot explain the script functionality and generated results and information. Because of this reason there is a increased need for developing and using script which can be easy explained and adapted to any analysis which should be made by the examine
- Publication Date: 13-Oct-2013
- DOI: 10.15224/978-981-07-7965-8-16
- Views: 0
- Downloads: 0