DOCUMENT CLASSIFICATION USING DISTRIBUTED MACHINE LEARNING
Published In: 2ND INTERNATIONAL CONFERENCE ON ADVANCES IN INFORMATION PROCESSING AND COMMUNICATION TECHNOLOGY
Author(s): GALIP AYDIN , IBRAHIM RIZA HALLAC
Abstract: In this paper, we investigate several machine learning algorithms for automatic classification of Turkish news into predetermined categories like economy, life, health etc. We use Apache Big Data technologies such as Hadoop, HDFS, Spark and Mahout, and distributed machine learning frameworks.
- Publication Date: 19-Apr-2015
- DOI: 10.15224/978-1-63248-044-6-129
- Views: 0
- Downloads: 0
USE OF OUTLIER DETECTION IN DATABASE AVAILABILITY ANALYSIS
Published In: 2ND INTERNATIONAL CONFERENCE ON ADVANCES IN INFORMATION PROCESSING AND COMMUNICATION TECHNOLOGY
Author(s): MARIO ZGELA
Abstract: Usually, database is available and accessible by authorized users. However, there are abnormal situations resulting in database unavailability. Since irregularity may be related to the notion of outliers, in this paper it is checked if outlier detection may be helpful in finding out abnormality in database usage and so improving the process of database availability analysis.
- Publication Date: 19-Apr-2015
- DOI: 10.15224/978-1-63248-044-6-133
- Views: 0
- Downloads: 0