Mining Informal Language from Chinese Microtext: Joint Word Recognition and Segmentation
Research Area: Natural Language Processing Year: 2013
Type of Publication: In Proceedings  
  • Aobo Wang
  • Min-Yen Kan
We address the problem of informal word recognition in Chinese microblogs. A key problem is the lack of word delimiters in Chinese. We exploit this reliance as an opportunity: recognizing the relation between informal word recognition and Chinese word segmentation, we propose to model the two tasks jointly. Our joint inference method significantly outperforms baseline systems that conduct the tasks individually or sequentially.
Digital version