Proceedings of
International Conference on Recent Trends in Computing and Communication Engineering RTCCE 2013
"AN EFFECTIVE STEMMER IN DEVANAGARI SCRIPT"
Abstract: “In today’s word of internet web search engines are developing the techniques to make the surfing faster. Stemming is a technique used by web search engines for prefix and suffix removal from the derived word. Stemming provides the way to store similar documents together. This research work aims at the development of Hindi stemmer based on Devanagari script for stripping both prefixes as well as suffixes from derived word to provide better stemming than previous stemmers. Proposed stemmer uses the hybrid approach which is the combination of lookup algorithm, suffix stripping algorithm and prefix removal algorithm.”
Keywords: natural language processing, stemming, overstemming, under-stemming, inflected word, Information Retrieval –IR, conflation