Dietz, Laura

Dietz, Laura
Associate Professor, Computer Science

Member, College of Engineering and Physical Sciences (CEPS) , University of New Hampshire
Member, Tenure Faculty (CEPS) , College of Engineering and Physical Sciences (CEPS)
Member, Computer Science , College of Engineering and Physical Sciences (CEPS)

Research Areas

Computer Science
SCIENCE & TECHNOLOGY/MATHEMATICS/COMPUTER SCIENCE

Overview

My research combines methods for text retrieval, extraction, machine learning and analytics (TREMA).

Currently, I am working on methods that automatically, and in a query-driven manner, retrieve materials from the Web and compose Wikipedia-like articles. Especially for information needs, where the user has very little prior knowledge about, the web search paradigm of 10 blue hyperlinks is not sufficient. Instead, I envision to provide a synthesis of the Web materials to give a comprehensive overview (TREC CAR).

My goal is to develop algorithm to find what users are looking for based on text content only. In contrast, most Web-search algorithms are based on interaction data such as query-log, click, or session information---information that is not available when searching private document collections. Consequently, we aim to maximize the utility of information retrieval models in combination with methods from natural language processing.

A particular emphasis of my work is to utilize information from structured knowledge bases such as Wikipedia, Freebase, or DBpedia together with text-based reasoning on general document and Web corpora (KG4IR). In my work on "Entity Query Feature Expansion" (SIGIR 2014), I demonstrate that significantly better search results are obtained when using entity linking and knowledge bases in the retrieval algorithm.

Selected Publications

Academic Article

Year	Title
2020	Toward comprehensive event collections. International Journal on Digital Libraries. 21:215-229. 2020
2019	Special issue on knowledge graphs and semantics in text analysis and retrieval. Information retrieval (Boston). 22:229-231. 2019
2019	Special issue on knowledge graphs and semantics in text analysis and retrieval \textbf[Special Issue]. Information Retrieval Journal. 1-3. 2019
2018	Knowledge-rich image gist understanding beyond literal meaning. Data & Knowledge Engineering. 117:114-132. 2018
2018	Toward a computational history of universities: Evaluating text mining methods for interdisciplinarity detection from PhD dissertation abstracts. Digital Scholarship in the Humanities. 33:612-620. 2018
2018	ACM SIGIR Student Liaison Program. ACM SIGIR Forum. 51:42-45. 2018
2018	Overview of The First Workshop on Knowledge Graphs and Semantics for Text Retrieval and Analysis (KG4IR). ACM SIGIR Forum. 51:139-144. 2018
2018	Toward comprehensive event collections. International Journal on Digital Libraries. 1-15. 2018
2018	Understanding the Gist of Images-Ranking of Concepts for Multimedia Indexing. arXiv preprint arXiv:1809.08593. 2018
2017	Data from the paper: Towards a Computational History of Universities: Evaluating Text Mining Methods for Interdisciplinarity Detection from Ph. D. Dissertation Abstracts. Digital Scholarship in the Humanities. 2017
2016	Capturing interdisciplinarity in academic abstracts. D-Lib Magazine. 22:9-9. 2016
2016	Enhancing domain-specific entity linking in DH. Computational Linguistics. 2:67-88. 2016
2011	Inferring functional modules of protein families with probabilistic topic models. BMC Bioinformatics. 12:141-141. 2011
2010	Directed factor graph notation for generative models. Max Planck Institute for Informatics, Tech. Rep. 2010
2006	Exploring Social Topic Networks with the Author-Topic Model. Proceedings of ESWC’06. 54-60. 2006
2003	An Ubiquitous and Multimedia Environment for Education (EUME). Learning Technology. 5. 2003
	Across-Document Neighborhood Expansion for Candidate Retrieval
	CCR@ TREC 2012 KBA
	Stimulating Massively Multiplayer Cooperation with Co-located Game Concepts. PerGames.
	Supplementary Material for" Localizing Bugs in Program Executions with Graphical Models

Article

Year	Title
2024	LLM-based relevance assessment still can't replace human relevance assessment 2024
2024	A Workbench for Autograding Retrieve/Generate Systems 2024
2024	An Exam-based Evaluation Approach Beyond Traditional Relevance Judgments 2024
2023	Retrieve-Cluster-Summarize: An Alternative to End-to-End Training for Query-specific Article Generation 2023
2023	Perspectives on Large Language Models for Relevance Judgment 2023

Chapter

Year	Title
2023	ECIR 23 Tutorial: Neuro-Symbolic Approaches for Information Retrieval. Lecture Notes in Computer Science. 324-330. 2023
2023	Entity Embeddings for Entity Ranking: A Replicability Study. Lecture Notes in Computer Science. 117-131. 2023
2005	Cooperation in ubiquitous computing: an extended view on sharing. 241-250. 2005

Conference Paper

Year	Title
2024	Pencils Down! Automatic Rubric-based Evaluation of Retrieve/Generate Systems. Proceedings of the 2024 ACM SIGIR International Conference on Theory of Information Retrieval. 175-184. 2024
2024	A Workbench for Autograding Retrieve/Generate Systems. Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval. 1963-1972. 2024
2023	Perspectives on Large Language Models for Relevance Judgment. Proceedings of the 2023 ACM SIGIR International Conference on Theory of Information Retrieval. 39-50. 2023
2023	Neuro-Symbolic Representations for Information Retrieval. Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval. 3436-3439. 2023
2022	Topic-Mono-BERT: A Joint Retrieval-Clustering System for Retrieving Overview Passages. Proceedings of the 14th Annual Meeting of the Forum for Information Retrieval Evaluation. 54-59. 2022
2022	Query-specific Subtopic Clustering. 2022 ACM/IEEE JOINT CONFERENCE ON DIGITAL LIBRARIES (JCDL). 1-9. 2022
2021	Report on the first hipstir workshop on the future of information retrieval. ACM SIGIR Forum. 62-75. 2021
2020	A Large Test Collection for Entity Aspect Linking. Proceedings of the 29th ACM International Conference on Information & Knowledge Management. 3109-3116. 2020
2020	Alligator collector: a latency-optimized garbage collector for functional programming languages. Proceedings of the 2020 ACM SIGPLAN International Symposium on Memory Management. 87-99. 2020
2019	An Analysis of Deep Contextual Word Embeddings and Neural Architectures for Toponym Mention Detection in Scientific Publications. NAACL Workshop on Workshop on extracting structured knowledge from scientific publications (ESSP). 2019
2019	Local and Global Query Expansion for Hierarchical Complex Topics. ECIR 2019 European Conference on Information Retrieval. 2019
2019	Special issue on knowledge graphs and semantics in text analysis and retrieval \textbf[Special Issue] 2019
2019	UNH at SemEval-2019 Task 12: Toponym Resolution in Scientific Papers. The 2019 International Workshop on Semantic Evaluation colocated with NAACL (SemEval). 2019
2019	Why does this Entity matter? Support Passage Retrieval for Entity Retrieval. PROCEEDINGS OF THE 2019 ACM SIGIR INTERNATIONAL CONFERENCE ON THEORY OF INFORMATION RETRIEVAL (ICTIR'19). 220-223. 2019
2018	Entity-Aspect Linking: Providing Fine-Grained Semantics of Entities in Context. JCDL'18: PROCEEDINGS OF THE 18TH ACM/IEEE JOINT CONFERENCE ON DIGITAL LIBRARIES. 49-58. 2018
2018	TREC Complex Answer Retrieval Overview. Proceedings of TREC. 2018
2018	TREMA-UNH at TREC 2018: Complex Answer Retrieval and News Track. Text Retrieval Conference (TREC). 2018
2018	The Second Workshop on Knowledge Graphs and Semantics for Text Retrieval, Analysis, and Understanding (KG4IR). The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval. 1423-1426. 2018
2018	UKParl: A Data Set for Topic Detection with Semantically Annotated Text 2018
2018	Utilizing Knowledge Graphs for Text-Centric Information Retrieval. ACM/SIGIR PROCEEDINGS 2018. 1387-1390. 2018
2018	WordNetContext: Information Retrieval-friendly Access to WordNet Senses.. ProfS/KG4IR/Data: Search@ SIGIR. 63-64. 2018
2017	Benchmark for Complex Answer Retrieval. ICTIR ’17 Proceedings of the ACM SIGIR International Conference on Theory of Information Retrieval. 293-296. 2017
2017	Building Entity-Centric Event Collections. Proceedings of the 17th ACM/IEEE Joint Conference on Digital Libraries Digital Libraries (JCDL), 2017 ACM/IEEE Joint Conference on. 199-208. 2017
2017	Building Entity-Centric Event Collections For Supporting Research in Political and Social History. Digital Humanities. 2017
2017	Open Relation Extraction for Support Passage Retrieval: Merit and Open Issues. Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval. 1149-1152. 2017
2017	TREC Complex Answer Retrieval overview. Proceedings of TREC. 2017
2017	TREMA-UNH at TREC 2017: Complex Answer Retrieval. Text Retrieval Conference (TREC). 2017
2017	The First Workshop on Knowledge Graphs and Semantics for Text Retrieval and Analysis (KG4IR). Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval. 1427-1428. 2017
2017	Using Object Detection, NLP, and Knowledge Bases to Understand the Message of Images. Lecture Notes in Computer Science. 405-418. 2017
2017	Utilizing Knowledge Graphs in Text-centric Information Retrieval. Proceedings of the Tenth ACM International Conference on Web Search and Data Mining Proceedings of the Tenth ACM International Conference on Web Search and Data Mining. 815-816. 2017
2017	Women in IR. ACM SIGIR Forum. 15-17. 2017
2016	Entity Relatedness for Retrospective Analyses of Global Events. NLP+CSS: Workshops on Natural Language Processing and Computational Social Science at Conference of Web Science 2016 May 22, 2016, Hannover, Germany. 2016
2016	Finding Relevant Relations in Relevant Documents. Advances in Information Retrieval. 38th European Conference on IR Research, ECIR 2016, Padua, Italy, March 20–23, 2016.. 654-660. 2016
2016	Finding Relevant Relations in Relevant Documents. Advances in Information Retrieval: 38th European Conference on IR Research, ECIR 2016, Padua, Italy, March 20–23, 2016. Proceedings. 654-660. 2016
2016	Topic model tutorial: A basic introduction on latent dirichlet allocation and extensions for web scientists. Proceedings of the 8th ACM Conference on Web Science. 10-10. 2016
2016	Tutorial on Utilizing knowledge bases in text-centric information retrieval. Proceedings of the 2016 ACM International Conference on the Theory of Information Retrieval Proceedings of the 2016 ACM on International Conference on the Theory of Information Retrieval. 5-5. 2016
2016	Understanding the message of images with knowledge base traversals. Proceedings of the 2016 ACM International Conference on the Theory of Information Retrieval. 199-208. 2016
2015	An interface sketch for queripidia: Query-driven knowledge portfolios from the web. Proceedings of the Eighth Workshop on Exploiting Semantic Annotations in Information Retrieval. 43-46. 2015
2015	Image with a Message: Towards detecting non-literal image usages by visual linking 2015
2015	Ranking Entities for Web Queries Through Text and Knowledge. ACM International Conference on Information and Knowledge Management (CIKM). 2015
2015	UMass at TREC WEB 2014: Entity Query Feature Expansion using Knowledge Base Links. Text Retrieval Conference (2015). 2015
2014	Entity query feature expansion using knowledge base links. Proceedings of the 37th international ACM SIGIR conference on Research & development in information retrieval. 365-374. 2014
2014	Queripidia: Query-specific Wikipedia Construction. NIPS Workshop for Automated Knowledge Base Construction (AKBC). 2014
2014	UMass CIIR at TAC KBP 2013 Entity Linking: Query Expansion using Urban Dictionary. Text Analysis Conference (TAC). 2014
2014	UMass at BioASQ 2014: Figure-inspired Text Retrieval.. CLEF (Working Notes). 1296-1310. 2014
2014	UMass at TREC 2013 Knowledge Base Acceleration Track. Text Retrieval Conference (TREC). 2014
2013	A neighborhood relevance model for entity linking. Proceedings of the 10th Conference on Open Research Areas in Information Retrieval. 149-156. 2013
2013	Constructing query-specific knowledge bases. Proceedings of the 2013 workshop on Automated knowledge base construction. 55-60. 2013
2013	Retrieving opinions from discussion forums. Proceedings of the 22nd ACM international conference on Information & Knowledge Management. 1225-1228. 2013
2013	Time-aware evaluation of cumulative citation recommendation systems. Proceedings of the SIGIR 2013 workshop on time-aware information access. 2013
2013	UMass CIIR at TAC KBP 2013 Entity Linking. Proc. Text Analysis Conference (TAC2013). 2013
2013	UMass at TREC 2013 Knowledge Base Acceleration Track: Bi-directional Entity Linking and Time-aware Evaluation. Text Retrieval Conference (TREC). 2013
2012	Acrossdocument neighborhood expansion: UMass at TAC KBP 2012 entity linking. Text Analysis Conference (TAC). 2012
2012	Bi-directional linkability from Wikipedia to documents and back again: UMass at TREC 2012 knowledge base acceleration track. Text Retrieval Conference. 2012
2012	De-Layering Social Networks by Shared Tastes of Friendships.. International Conference on Weblogs and Social Media (ICWSM). 2012
2010	Inferring Shared Interests from Social Networks. Proceedings of Neural Information Processing Systems Workshop on Computational Social Science and the Wisdom of Crowds. 2010
2009	Localizing bugs in program executions with graphical models. Advances in Neural Information Processing Systems. 468-476. 2009
2009	Modeling shared tastes in online communities. NIPS Workshop on Applications for Topic Models: Text and Beyond. 2009
2008	Probabilistic Graph Models for Debugging Software. NIPS Workshop on Analyzing Graphs: Theory and Applications. 2008
2007	Modeling Evolution of Ideas in the Web of Science. NIPS 2007. 1-2. 2007
2007	Unsupervised prediction of citation influences. Proceedings of the 24th international conference on Machine learning. 233-240. 2007
2007	of Proceedings: ICML 2007: proceedings of the Twenty-Fourth International Conference on Machine Learning 2007
2006	Utilize probabilistic topic models to enrich knowledge bases. Proc. of the ESWC 2006 Workshop on Mastering the Gap: From Information Extraction to Semantic Representation. 2006
2004	Single display gaming: Examining collaborative games for multi-user tabletops. Workshop on Gaming Applications in Pervasive Computing Environments at Pervasive. 2004
2003	ConcertStudeo: Using PDAs to support face-to-face learning. International Conference on Computer Support for Collaborative Learning. 235-237. 2003
	Proceedings of the First Workshop on Knowledge Graphs and Semantics for Text Retrieval and Analysis (KG4IR 2017) co-located with the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2017), Shinjuku, Tokyo, Japan, August 11, 2017. CEUR Workshop Proceedings. Ed. Dietz, Laura.
	Workshop on Kn owledge Graph Technology and Applications (KGTA). WWW 2019: International World Wide Web Conference.
	Workshop on extracting structured knowledge from scientific publications (ESSP). NAACL 2019 Annual Conference of the North American Chapter of the Association of Computational Linguistics.

Editor Of

Proceedings of the First Workshop on Knowledge Graphs and Semantics for Text Retrieval and Analysis (KG4IR 2017) co-located with the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2017), Shinjuku, Tokyo, Japan, August 11, 2017. CEUR Workshop Proceedings.

Principal Investigator On

Fine-grained Knowledge awarded by National Science Foundation (NSF) 2019 - 2025

CAREER: Utilizing Fine-grained Knowledge Annotations in Text Understanding and Retrieval awarded by National Science Foundation (NSF) 2019 - 2023

Forecasting Salinity in Rivers During Storm Events awarded by Columbia University 2020 - 2021

Teaching Activities

Algorithms Taught course

DS - Knowledge Graphs and Text Taught course

Doctoral Research Taught course

Information Retrieval Taught course

Doctoral Research Taught course 2024

Foundations of Neural Networks Taught course 2024

Doctoral Research Taught course 2023

Doctoral Research Taught course 2022

Foundations of Neural Networks Taught course 2022

Algorithms Taught course 2022

DS - Knowledge Graphs and Text Taught course 2022

Doctoral Research Taught course 2022

Independent Study Taught course 2022

Doctoral Research Taught course 2021

Information Retrieval Taught course 2021

Internship Experience Taught course 2021

Algorithms Taught course 2021

DS - Knowledge Graphs and Text Taught course 2021

Doctoral Research Taught course 2021

Doctoral Research Taught course 2020

Top/Machine Learn for Sequnces Taught course 2020

Algorithms Taught course 2020

DS - Knowledge Graphs and Text Taught course 2020

Doctoral Research Taught course 2020

Doctoral Research Taught course 2019

Information Retrieval Taught course 2019

DS - Knowledge Graphs and Text Taught course 2019

Doctoral Research Taught course 2019

Information Retrieval Taught course 2018

Adv Top/Data Sci w/ KnowGraphs Taught course 2018

Independent Study Taught course 2017

Information Retrieval Taught course 2017

Adv Top/Data Science Taught course 2017

Top/Information Retrieval Taught course 2016

Education And Training

B.S., Goethe University, Germany

M.S., Goethe University, Germany

Ph.D. Computer Science, Max Planck Institute

Full Name

Laura Dietz

Dietz, Laura
Associate Professor, Computer Science

Research Areas

Overview

Overview

Publications

Selected Publications

Academic Article

Article

Chapter

Conference Paper

Editor Of

Research

Principal Investigator On

Teaching

Teaching Activities

Background

Education And Training

Contact

Full Name

Dietz, Laura Associate Professor, Computer Science

Websites

Visualizations

Research Areas

Overview

Overview

Publications

Selected Publications

Academic Article

Article

Chapter

Conference Paper

Editor Of

Research

Principal Investigator On

Teaching

Teaching Activities

Background

Education And Training

Contact

Full Name

Dietz, Laura
Associate Professor, Computer Science