4th Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural Language Processing for Digital Libraries (BIRNDL 2019)

at the 42ndInternational ACM SIGIR Conference on Research and Development in Information Retrieval, Paris, France

Workshop sponsored by SRI International and Chan-Zuckerberg Initiative

Last Updated: Wed May 15 23:35:41 PDT 2019
  • Prof. Bonnie Webber, U.Edinburgh will deliver the Adobe Distinguished Keynote!
  • Alex Wade from Chan-Zuckerberg Initiative (CZI) will deliver a keynote!
  • Select BIRNDL papers will be invited to special issue on Mining Knowledge from Scientific Data by Tanmoy, Sumit and Cornelia
  • BIRNDL 2019 workshop will be hosted at ACM SIGIR 2019, Paris! First Call for papers for all tracks including the CL-SciSumm Shared Task will be out soon! Thanks to the PCs, for their continued support!
  • BIRNDL 2019 workshop proposed at ACM SIGIR 2019!


Call for Papers

You are invited to participate in the 4th Joint Workshop on Bibliometric-enhanced IR and NLP for Digital Libraries (BIRNDL).

This is the 4th BIRNDL workshop, following a series of successful BIRNDL and BIR workshops other premier NLP/IR/DL venues. In conjunction with the BIRNDL workshop, we will also hold the 5th CL-SciSumm Shared Task in Scientific Document Summarization.

Reports from the shared task systems will be featured as part of a session at the workshop.

Aim of the Workshop

The goal of the BIRNDL workshop at SIGIR 2019 is to engage the IR community in the open problems in Big Science. Big Science refers to the large, cross-domain digital repositories which index research papers, such as the ACL Anthology, ArXiv, ACM Digital Library, PubMed, IEEE database, Web of Science and Google Scholar. Currently, digital libraries collect and allow access to digital papers and their metadata---inclusive of citations---but mostly do not analyze the items they index. The scale of growth in scholarly publishing poses a challenge for scholars in their search for relevant literature. Finding relevant scholarly literature is the key focus of the workshop and sets the agenda for methods and approaches to be discussed and evaluated at BIRNDL.

We invite papers and presentations that incorporate insights from IR, bibliometrics and NLP to develop new techniques to address the open problems in Big Science, such as evidence-based searching, measurement of research quality, relevance and impact, the emergence and decline of research problems, identification of scholarly relationships and influences and applied problems such as language translation, question-answering and summarization.

For your reference please see proceedings of the 3rd BIRNDL workshop here and a recent report in SIGIR Forum http://sigir.org/wp-content/uploads/2019/01/p105.pdf.

Workshop Topics

By design, BIRNDL is an inclusive and diverse venue, in terms of both constituency and research. To promote a diverse constituency, we explicitly encourage female first authors. We invite stimulating research on topics including, but not limited to, full-text analysis, including multilingual analysis, IR methods for DL, and applications of citation-based NLP. Specific examples of fields of interest include:

Importantly, to address the scarcity of validated datasets in this area, we also invite papers describing new and pre-existing datasets. Submissions in this track will include instructions for accessing the data; metadata and documentation on its organization, content, and quality; and descriptions of possible use cases. We also invite descriptions of running projects and ongoing work as well as contributions from industry. Papers that investigate multiple themes directly are especially welcome.


Special Issue

Select BIRNDL papers will be invited to special issue on Mining Knowledge from Scientific Data by Tanmoy, Sumit and Cornelia.



The CL-SciSumm Shared Task


The CLSciSumm19 corpus is expected to be of interest to a broad community including those working in natural language processing, machine learning, computational linguistics, text summarization, discourse structure in scholarly discourse, paraphrase, textual entailment and text simplification.

The task constitutes automatic scientific paper summarization in the Computational Linguistics (CL) domain. The output summaries will be of two types: faceted summaries of the traditional self-summary (the abstract) and the community summary (the collection of citation sentences .citances.). We also propose to group the citances by the facets of the text that they refer to.

At SIGIR 2019, we will hold the 5th Computational Linguistics (CL) Scientific Summarization Shared Task CL-SciSumm 2019 which is sponsored by SRI International and Chan-Zuckerberg Initiative (CZI). This task follows up on the successful CLSciSumm series since 2016 and a Pilot Task TAC 2014.



Important Dates

SubmissionsMay 3, 2019 May 15, 2019
NotificationJune 18, 2019 May 31, 2019
Camera Ready ContributionsJuly 7, 2019
WorkshopJuly 25, 2019

Check the CL-SciSumm 2019 Shared Task homepage for details on dates with respect to the shared task. The dates are coordinated.

All deadlines for the BIRNDL workshop are calculated as 11:59pm Baker Island Time (BIT: UTC/GMT-12).



Invited Talks

Adobe Distinguished Keynote:

Prof. Bonnie Webber, Univ. of Edinburgh
Talk Title: TBD

BIO Bonnie Webber received her PhD from Harvard University and taught at the University of Pennsylvania in Philadelphia for 20 years before joining the School of Informatics at the University of Edinburgh, where she is now professor emeritus. Known for her research on discourse anaphora and discourse relations, she has served as President of the Association for Computational Linguistics (ACL) and Deputy Chair of the European COST action IS1312, "TextLink: Structuring Discourse in Multilingual Europe". Along with Aravind Joshi, Rashmi Prasad, Alan Lee and Eleni Miltsakaki, she is co-developer of the Penn Discourse TreeBank (both the 2008 Version 2 release, and the about-to-be-released Version 3.) She is a Fellow of the Association for Advancement of Artificial Intelligence (AAAI), the Association for Computational Linguistics (ACL) and the Royal Society of Edinburgh (RSE), where she is also convenor of the Research Awards Committee. She works towards promoting women to more prominent positions in the NLP community and in Science and Technology more generally.

Alex Wade, Meta @ Chan-Zuckerberg Initiative (CZI)
Talk Title: Personalized feed/query-formulation, predictive impact, and ranking

Abstract: The Chan Zuckerberg Initiative's Meta discovery system is designed to help the biomedical research community stay up to date with the latest and most important papers and preprints, through feeds and search. Meta can generate a personalized feed of newly published papers specific and relevant to each user's scientific interests by leveraging state of the art embeddings and clustering techniques. Meta further calculates an article-level predicted Eigenfactor which is used in ranking the papers within each feed. This talk will demonstrate the Meta application and will cover some of the recent bibliometric approaches to query formulation and ranking to improve retrieval of recently published academic publications.

BIO:Alex Wade currently works with the Chan Zuckerberg Initiative as technical program manager for Meta. Previously Wade served as the Director for Scholarly Communication for Microsoft Research, focused on Microsoft Academic, a semantic knowledge graph of academic research publications, people, and institutions. During his career at Microsoft, Wade managed the corporate search and taxonomy management services and served as Senior Program Manager for Windows Search. Prior to joining Microsoft, he held Systems Librarian, Engineering Librarian, and Philosophy Librarian, and technical library positions at the University of Washington, the University of Michigan, and the University of California, Berkeley. Wade holds a bachelor's degree in Philosophy from the University of California, Berkeley, and a Master of Librarianship degree from the University of Washington




Submission Information

Regular Research paper track: All submissions must be written in English, following the Springer LNCS author guidelines (max. 6 pages for short and 12 pages for full papers; exclusive of unlimited pages for references) and should be submitted as PDF files to EasyChair. All submissions will be reviewed by at least two independent reviewers. Please be aware of the fact that at least one author per paper needs to register for the workshop and attend the workshop to present the work. In case of no-show the paper (even if accepted) will be deleted from the proceedings and from the program Submissions and reviewing will be managed by the EasyChair conference management system.

Poster track: We welcome submissions detailing original, early findings, works in progress and industrial applications of bibliometrics and IR for a special poster session, possibly with a 2-minute presentation in the main session. Some research track papers will also be invited to the poster track instead, although there will be no difference in the final proceedings between poster and research track submissions. These papers should follow the same format as the research track papers.

Shared Task: Teams that wish to participate in the CL Shared Task track at BIRNDL 2019 are invited to register on EasyChair by April 15th with a title and a tentative abstract describing their approach. Participants are advised to register as soon as possible in order to receive timely access to evaluation resources, including development and testing data. Registration for the task does not commit you to participation - but is helpful to know for planning. All participants who submit system runs are welcome to present their system at the BIRNDL Workshop in the poster session, while the best performing system will be invited to present their paper in the main session. Dissemination of CL-SciSumm work and results other than in the workshop proceedings is welcomed, but the conditions of participation specifically preclude any advertising claims based on these results. Any questions about conference participation may be sent to the organizers mentioned below.

Submit a paper

Workshop proceedings will be deposited online in the CEUR workshop proceedings publication service (ISSN 1613-0073) and on the ACL anthology - This way the proceedings will be permanently available and citable (digital persistent identifiers and long term preservation)




Main Organising Committee

Muthu Kumar Chandrasekaran - muthu.chandrasekaran@sri.com

is an Advanced Computer Scientist, Machine Learning at SRI International's Artificial Intelligence Center. Previously he was a Ph.D. student at NUS School of Computing. He is broadly interested in natural language processing, machine learning and their applications to information retrieval; specifically, in retrieving and organising information from asynchronous conversation media such as scholarly publications and discussion forums. He has been co-organizing the CL-SciSumm Shared Task series and the BIRNDL workshop series since 2014. He also reviews for ACL, EMNLP, NAACL and JCDL conferences. During his PhD he also spent time at the Allen Institute for Artificial Intelligence's Semantic Scholar research and National Institute of Informatics, Tokyo.

Philipp Mayr - philipp.mayr@gesis.org

Philipp Mayr is a deputy department head and a team leader at the GESIS -- Leibniz-Institute for the Social Sciences department Knowledge Technologies for the Social Sciences (WTS). He has been a visiting professor for knowledge representation at University of Applied Sciences in Darmstadt, Department of Information Science and Engineering during 2009-2011. Philipp Mayr received his PhD in applied informetrics and information retrieval from the Berlin School of Library and Information Science at Humboldt University Berlin in 2009. To date, he has been awarded substantial research funding (PI, Co-PI) from national and European funding agencies. Philipp Mayr has published in top conferences and prestigious journals in the areas informetrics, information retrieval and digital libraries. His research group focuses on methods and techniques for interactive information retrieval. Philipp Mayr was the main organizer of the Combining Bibliometrics and Information Retrieval at ISSI 2013, the BIR workshops at ECIR 2014, 2015 and 2016 and the first BIRNDL workshop at JCDL 2016.

Dayne Freitag - freitag@ai.sri.com

Dayne is the director of the Advanced Analytics group in SRI's Artificial Intelligence Center. His research seeks to apply artificial intelligence to information assimilation, management and exploitation. Freitag has served as principal investigator for a number of research projects including several large, multi-institutional efforts. His research goals have focused on the automation of data science; the automatic extension of mechanistic models through machine reading; knowledge federation over diverse information sources through data analytics and natural language processing; explaining the spread of ideas through online communities; and novel approaches to institutional knowledge management using controlled English. Freitag holds a B.A. in English literature from Reed College, and a Ph.D. in computer science from Carnegie Mellon University.

Min-Yen Kan - kanmy@comp.nus.edu.sg

I am an Associate Professor at School of Computing, Singapore. My research interests fall under the areas of digital libraries, natural language processing, information retrieval, human-computer interaction. Specifically, they include document structure acquisition, verb analysis, digital library resource annotation and and applied text summarization. My research goal aims to investigate how natural language processing and information retrieval can be applied to improve scholarly publication and knowledge discovery.

The main organizers will be supported by our previous co-organizers:



Programme Committee

The following committee members support the workshop series and will form our reviewer pool:




Related Workshops by Organisers of BIRNDL

Related Workshops by others