> Home > Publications

Publications

Ph.D. Thesis

  1. Yee Fan Tan (2011). Cost-Sensitive Web-Based Information Acquisition for Record Matching. Ph.D. thesis, School of Computing, National University of Singapore, December 2011. [PDF]

Journal, Conference, and Workshop Publications

  1. Jonathan Yan Horn Poon, Kazunari Sugiyama, Yee Fan Tan, and Min-Yen Kan (2012). Instructor-Centric Source Code Plagiarism Detection and Plagiarism Corpus. In Proceedings of the 17th ACM SIGCSE Conference on Innovation and Technology in Computer Science Education (ITiCSE), pages 122-127, Haifa, Israel, July 2012. [PDF]
  2. Yee Fan Tan and Min-Yen Kan (2010). Hierarchical Cost-sensitive Web Resource Acquisition for Record Matching. In Proceedings of the 9th IEEE/WIC/ACM International Conference on Web Intelligence (WI), pages 382-389, Toronto, Canada, August-September 2010. [PDF]
  3. Thuy Dung Nguyen, Min-Yen Kan, Dinh-Trung Dang, Markus Hänse, Ching Hoi Andy Hong, Minh-Thang Luong, Jesse Prabawa Gozali, Kazunari Sugiyama, and Yee Fan Tan (2010). ForeCite: towards a reader-centric scholarly digital library. In Proceedings of the 10th ACM/IEEE Joint Conference on Digital Libraries (JCDL), pages 387-388, Gold Coast, Queensland, Australia, June 2010. [PDF]
  4. Dinh-Trung Dang, Yee Fan Tan, and Min-Yen Kan (2008). Towards a Webpage-based Bibliographic Manager. In Proceedings of the 11th International Conference on Asian Digital Libraries (ICADL), pages 313-316, Bali, Indonesia, December 2008. [PDF]
  5. Yee Fan Tan, Ergin Elmacioglu, Min-Yen Kan, and Dongwon Lee (2008). Efficient Web-Based Linkage of Short to Long Forms. In Proceedings of the 11th International Workshop on the Web and Databases (WebDB), Vancouver, Canada, June 2008. [PDF]
  6. Steven Bird, Robert Dale, Bonnie Dorr, Bryan Gibson, Mark Joseph, Min-Yen Kan, Dongwon Lee, Brett Powley, Dragomir Radev, and Yee Fan Tan (2008). The ACL Anthology Reference Corpus: A Reference Dataset for Bibliographic Research in Computational Linguistics. In Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC), pages 1755-1759, Marrakech, Morocco, May 2008. [PDF] [Corpus]
  7. Min-Yen Kan and Yee Fan Tan (2008). Record Matching in Digital Library Metadata. In Communications of the ACM (CACM), Volume 51, Issue 2, pages 91-94, February 2008. [PDF]
  8. Ergin Elmacioglu, Yee Fan Tan, Su Yan, Min-Yen Kan, and Dongwon Lee (2007). PSNUS: Web People Name Disambiguation by Simple Clustering with Rich Features. In Proceedings of the 4th International Workshop on Semantic Evaluations (SemEval), pages 268-271, Prague, Czech Republic, June 2007. [PDF]
  9. Yee Fan Tan, Min-Yen Kan, and Dongwon Lee (2006). Search Engine Driven Author Disambiguation. In Proceedings of the 6th ACM/IEEE Joint Conference on Digital Libraries (JCDL), pages 314-315, Chapel Hill, North Carolina, USA, June 2006. [PDF]
  10. Yee Fan Tan, Min-Yen Kan, and Hang Cui (2006). Extending corpus-based identification of light verb constructions using a supervised learning framework. In Proceedings of the EACL 2006 Workshop on Multi-word-expressions in a multilingual context (MWEmc), pages 49-56, Trento, Italy, April 2006. [PDF] [Corpus]
  11. Renxu Sun, Jing Jiang, Yee Fan Tan, Hang Cui, Tat-Seng Chua, and Min-Yen Kan (2005). Using Syntactic and Semantic Relation Analysis in Question Answering. In Proceedings of the 14th Text Retrieval Conference (TREC), Gaithersburg, Maryland, USA, November 2005. [PDF]

Technical Reports

  1. Yee Fan Tan and Min-Yen Kan (2010). A Framework for Hierarchical Cost-sensitive Web Resource Acquisition. Technical Report TRA3/10, School of Computing, National University of Singapore, March 2010. [PDF]
  2. Yee Fan Tan and Min-Yen Kan (2010). Cost-sensitive Attribute Value Acquisition for Support Vector Machines. Technical Report TRB3/10, School of Computing, National University of Singapore, March 2010. [PDF]