my NLP & AI Research

In the following you will find a collection of articles, research papers, and thought-provoking pieces on natural language processing and machine learning. My publications focus on text-generation, paraphrasing, and the social and ethical implications of these technologies. For statistics about the papers, check out Google Scholar or Semantic Scholar. Consider visiting the blog for thoughts beyond the research works.

MAGPIE: Multi-Task Media-Bias Analysis of Generalization of Pre-Trained Identification of Expressions
LREC-COLING 2024
Tomáš Horych, Martin Wessel, Jan Philip Wahle, Terry Ruas, Jerome Waßmuth, André Greiner-Petter, Akiko Aizawa, Bela Gipp, Timo Spinde
[pdf] [bibtex] [code]
Text-Guided Image Clustering
EACL 2024 (Oral)
Andreas Stephan, Lukas Miklautz, Kevin Sidak, Jan Philip Wahle, Bela Gipp, Claudia Plant, Benjamin Roth
[pdf] [bibtex] [code]
Paraphrase Types for Generation and Detection
EMNLP 2023
Jan Philip Wahle, Bela Gipp, Terry Ruas
[pdf] [bibtex] [code] [demo]
We are Who We Cite: Bridges of Influence Between NLP and Other Academic Fields
EMNLP 2023 (Oral)
Jan Philip Wahle, Terry Ruas, Mohamed Abdalla, Bela Gipp, Saif M. Mohammad
[pdf] [bibtex] [code] [demo]
The Elephant in the Room: Analyzing the Presence of Big Tech in NLP Research
ACL 2023 (Oral)
Mohamed Abdalla*, Jan Philip Wahle*, Terry Ruas, Aurelie Névéol, Fanny Ducel, Saif M. Mohammad, Karen Fort
[pdf] [bibtex] [code]
How Large Language Models are Transforming Machine-Paraphrase Plagiarism
EMNLP 2022 (Oral)
Jan Philip Wahle, Terry Ruas, Frederic Kirstein, Bela Gipp
[pdf] [bibtex] [code]
Analyzing Multi-Task Learning for Abstractive Text Summarization
EMNLP-GEM 2022
Frederic Kirstein, Jan Philip Wahle, Terry Ruas, Bela Gipp
[pdf] [bibtex] [code]
D3: A Massive Dataset of Scholarly Metadata for Analyzing the State of Computer Science Research
LREC 2022 (Oral)
Jan Philip Wahle, Terry Ruas, Saif Mohammad, Bela Gipp
[pdf] [bibtex] [code]
Identifying Machine-Paraphrased Plagiarism
iConference 2022
Jan Philip Wahle, Terry Ruas, Tomas Foltýnek, Norman Meuschke, Bela Gipp
[pdf] [bibtex] [code]
Testing the Generalization of Neural Language Models for COVID-19 Misinformation Detection
iConference 2022
Jan Philip Wahle*, Nischal Ashok*, Terry Ruas, Norman Meuschke, Tirthankar Ghosal, Bela Gipp
[pdf] [bibtex] [code]
Are Neural Language Models Good Plagiarists? A Benchmark for Neural Paraphrase Detection
JCDL 2021
Jan Philip Wahle, Terry Ruas, Norman Meuschke, Bela Gipp