Mining Informal Language from Chinese Microtext: Joint Word Recognition and Segmentation
Research Area: Natural Language Processing Year: 2013
Type of Publication: In Proceedings  
  • Aobo Wang
  • Min-Yen Kan
We address the problem of informal word recognition in Chinese microblogs. A key problem is the lack of word delimiters in Chinese. We exploit this reliance as an opportunity: recognizing the relation between informal word recognition and Chinese word segmentation, we propose to model the two tasks jointly. Our joint inference method signi´Čücantly outperforms baseline systems that conduct the tasks individually or sequentially.
Digital version