DSpace
 

Tai Nguyen So - Vietnam National University, Ha Noi - VNU >
TRƯỜNG ĐẠI HỌC CÔNG NGHỆ >
PTN Micro Nano >
Articles of Universities of Vietnam from Scopus >

Search

Please use this identifier to cite or link to this item: http://tainguyenso.vnu.edu.vn/jspui/handle/123456789/12441

Title: Fuzzy named entity-based document clustering
Authors: Cao T.H.
Do H.T.
Hong D.T.
Quan T.T.
Keywords: 
Issue Date: 2008
Publisher: IEEE International Conference on Fuzzy Systems
Citation: Volume , Issue , Page 2028-2034
Abstract: Traditional keyword-based document clustering techniques have limitations due to simple treatment of words and hard separation of clusters. In this paper, we introduce named entities as objectives into fuzzy document clustering, which are the key elements defining document semantics and in many cases are of user concerns. First, the traditional keyword-based vector space model is adapted with vectors defined over spaces of entity names, types, name-type pairs, and identifiers, instead of keywords. Then, hierarchical fuzzy document clustering can be performed using a similarity measure of the vectors representing documents. For evaluating fuzzy clustering quality, we propose a fuzzy information variation measure to compare two fuzzy partitions. Experimental results are presented and discussed. © 2008 IEEE.
URI: http://tainguyenso.vnu.edu.vn/jspui/handle/123456789/12441
ISSN: 10987584
Appears in Collections:Articles of Universities of Vietnam from Scopus

Files in This Item:

File SizeFormat
HCM_U227.pdf49.52 kBAdobe PDFView/Open

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

 

Valid XHTML 1.0! DSpace Software Copyright © 2002-2010  Duraspace - Feedback