Linguistics Threat identification Geographic origins Internet Weblogs Blogs
Issue Date:
2009
Publisher:
2009 International Conference on Asian Language Processing: Recent Advances in Asian Language Processing, IALP 2009
Citation:
Page : 190-194
Abstract:
This paper presents the first work in the task of author profiling for Vietnamese blogs. This task is
important in threat identification and marketing intelligence. We have developed a Vietnamese Blog
Profiling framework to automatically predict age, gender, geographic origin and occupation of weblogs'
authors purely based on language use. The experiments on the blogs corpus we collected show very
promising results with accuracy of around 80% across all traits. ?? 2009 IEEE.