Dr. Jan Philip Wahle

Dr. Jan Philip Wahle is a faculty member at the University of Göttingen. He received his PhD in Computer Science from the University of Göttingen and has been a visiting researcher at the National Research Council Canada (NRC) working with Dr. Saif M. Mohammad. Before his PhD, he worked as a software engineer for the autonomous driving company developing machine learning models and annotation tools for object detection and tracking at Aptiv PLC. His main research interests lie in machine learning and natural language processing with a focus on reasoning methods via reinforcement learning and AI safety via interpretability. His research has been presented at various conferences, including ACL and EMNLP, and won the ACL Best Resource Paper Award and the SemEval Best Task Award.

News

Here are updates about recent activities, such as talks, awards, travels, and workshops.

Jul 2026: ✈️ I will travel to Korea to present our progress on the fake news detection grant with the Korean National Police Agency (KNPA) and the Korea Institute of Police Technology (KIPoT).

Jun 2026: 💰 I received the CIDAS Fellowship from the Campus Institute Data Science (CIDAS) to work on measuring the faithfulness and safety of large reasoning models.

May 2026: ✈️ I will attend LREC in Palma de Mallorca to present our recent works.

May 2026: 🎤 I will give a talk at the University of Stuttgart on how the field of AI and NLP has evolved.

Mar 2026: 🎤 I gave a keynote on LibraryAI at the Library AI-Conference 2026 in Frankfurt.

Nov 2025: 🎓 I defended my PhD thesis.

Nov 2025: ✈️ I will attend EMNLP in Suzhou, China, to present our recent works on privacy leaking, spatial reasoning, and multi-agent debate.

Oct 2025: ✈️ I will attend the Korean Police Expo (KPEX) in Incheon to present our recent work on fake news generation and detection.

Oct 2025: 🎤 I will give a talk at the Cardiff NLP seminar for Nedjma Ousidhoum's group.

Sep 2025: 🎤 I will give a talk at the University of Zurich for Sarah Ebling's group.

Jul 2025: 🏆 We won the ACL Best Resource Paper Award and SemEval Best Task Paper Award for our work on emotion detection.

Jul 2025: 💰 The AI in Museums (MWK), EDIKILEX (MWK), and COPILOT (BMWK) projects got funded. Press Release.

Jun 2025: 💰 The Paraphrase Types project got funded by the DFG. Press Release.

May 2025: ✈️ I will attend ACL in Vienna to present our recent works on multi‑agent debate, meeting transcript synthesis, and multi‑lingual emotion recognition.

Apr 2025: 💰 The Fake News Detection project with the Korean National Police Agency (KNPA) got funded. Press Release.

Jan 2025: ✈️ I will attend COLING in Abu Dhabi to present our recent works on paraphrase types and recency bias.

Nov 2024: ✈️ I will attend EMNLP in Miami to present our recent works on prompt engineering and summarization.

Jul 2024: 🎤 I will give a talk to the Volkswagen Foundation about AI tools in the funding process.

Apr 2024: 🎤 I will give a talk about NLP innovations for businesses to the company Eschbach.

Mar 2024: ✈️ I am visiting the group of Benjamin Roth in Vienna.

Feb 2024: 🎤 I will give a talk at the Jantina Tammes School of Digital Society, Technology and AI organized by Tommaso Caselli about Insights, Findings, and Recommendations for Paraphrasing and Plagiarism in the Age of LLMs.

Dec 2023: ✈️ I will attend EMNLP in Singapore to present our works on paraphrase types and cross‑field citation dynamics.

Nov 2023: 🎤 I will give a talk at MaiNLP at LMU Munich to Barbara Plank's group about our EMNLP paper on cross‑field citation dynamics.

Jul 2023: ✈️ I will attend ACL in Toronto to present our work on big tech influence on NLP research.

Apr 2023: 🎓 I got awarded a six‑month scholarship by the DAAD to visit the National Research Council Canada and work with Saif M. Mohammad.

Mar 2023: 🎤 I will give a talk at the Open Science Workshop organized by Birgit Schmidt about AI in the scientific writing process

Dec 2022: ✈️ I will attend EMNLP in Abu Dhabi to present our work on how large language models are transforming plagiarism detection.

Publications

My main research interests during my PhD were in language modeling and understanding via paraphrase generation and detection. After my PhD, my main research interests have shifted to reasoning methods via reinforcement learning and AI safety via interpretability. You can also find all my publications on Google Scholar .

...

Papers

...

Citations

...

h-index

...

i10-index

ALDEN: Reinforcement Learning for Active Navigation and Evidence Gathering in Long Documents

ACL 2026 (Main)

Tianyu Yang, Terry Ruas, Yijun Tian, Jan Philip Wahle, Daniel Kurzawe, and Bela Gipp

[pdf] [bibtex]

DimABSA: Building Multilingual and Multidomain Datasets for Dimensional Aspect-Based Sentiment Analysis

ACL 2026 (Main)

Lung-Hao Lee, Liang-Chih Yu, Natalia Loukashevich, Ilseyar Alimova, Alexander Panchenko, Tzu-Mi Lin, Zhe-Yu Xu, Jian-Yu Zhou, Guangmin Zheng, Jin Wang, Sharanya Awasthi, Jonas Becker, Jan Philip Wahle, Terry Ruas, Shamsuddeen Hassan Muhammad, and Saif M. Mohammad

[pdf] [bibtex]

Who Watches the Watchmen? Humans Disagree With Translation Metrics on Unseen Domains

ACL 2026 (Findings)

Finn Schmidt, Jan Philip Wahle, Terry Ruas, and Bela Gipp

[pdf] [bibtex]

Affect, Body, Cognition, Demographics, and Emotion: The ABCDE of Text Features for Computational Affective Science

LREC 2026 (CAS)

Jan Philip Wahle, Krishnapriya Vishnubhotla, Bela Gipp, and Saif M. Mohammad

[pdf] [bibtex] [code] [data]

Piecing Together Cross-Document Coreference Resolution Datasets: Systematic Dataset Analysis and Unification

LREC 2026

Anastasia Zhukova, Terry Ruas, Jan Philip Wahle, and Bela Gipp

[pdf] [bibtex] [code] [data]

Stay Focused: Problem Drift in Multi-Agent Debate

EACL 2026 (Findings)

Jonas Becker, Lars Benedikt Kaesberg, Andreas Stephan, Jan Philip Wahle, Terry Ruas, and Bela Gipp

[pdf] [bibtex] [code]

Language Modeling and Understanding Through Paraphrase Generation and Detection

Dissertation 2025

Jan Philip Wahle

[pdf] [bibtex]

TrojanStego: Your Language Model Can Secretly Be A Steganographic Privacy Leaking Agent

EMNLP 2025 (Main)

Dominik Meier, Jan Philip Wahle, Paul Röttger, Terry Ruas, and Bela Gipp

[pdf] [bibtex] [code] [data] [model]

SPaRC: A Spatial Pathfinding Reasoning Challenge

EMNLP 2025 (Main)

Lars Kaesberg, Jan Philip Wahle, Terry Ruas, and Bela Gipp

[pdf] [bibtex] [code] [data] [webpage]

MALLM: Multi-agent large language models framework

EMNLP 2025 (Demo)

Jonas Becker, Lars Benedikt Kaesberg, Niklas Bauer, Jan Philip Wahle, Terry Ruas, and Bela Gipp

[pdf] [bibtex] [demo]

Voting or Consensus? Decision-Making in Multi-Agent Debate

ACL 2025 (Findings)

Lars Benedikt Kaesberg, Jonas Becker, Jan Philip Wahle, Terry Ruas, and Bela Gipp

[pdf] [bibtex] [code]

You need to MIMIC to get FAME: Solving Meeting Transcript Scarcity with Multi-Agent Conversations

ACL 2025 (Findings)

Frederic Kirstein, Muneeb Khan, Jan Philip Wahle, Terry Ruas, and Bela Gipp

[pdf] [bibtex] [code+data]

BRIGHTER: BRIdging the Gap in Human-Annotated Textual Emotion Recognition Datasets for 28 Languages

ACL 2025 (Main), Best Resource Paper Award 🏆

Shamsuddeen Hassan Muhammad, Nedjma Ousidhoum, Idris Abdulmumin, Jan Philip Wahle, Terry Ruas, Meriem Beloucif, Christine de Kock, Nirmal Surange, Daniela Teodorescu, Ibrahim Said Ahmad, and others

[pdf] [bibtex] [webpage] [data] [code]

SemEval-2025 task 11: Bridging the Gap in Text-Based Emotion Detection

SemEval @ ACL 2025, Best Task Award 🏆

Shamsuddeen Hassan Muhammad, Nedjma Ousidhoum, Idris Abdulmumin, Seid Muhie Yimam, Jan Philip Wahle, Terry Ruas, Meriem Beloucif, Christine de Kock, Tadesse Destaw Belay, Ibrahim Said Ahmad, and others

[pdf] [bibtex]

Citation Amnesia: On The Recency Bias of Natural Language Processing and Other Academic Fields

COLING 2025

Jan Philip Wahle, Terry Ruas, Mohamed Abdalla, Bela Gipp, and Saif Mohammad

[pdf] [bibtex] [code] [poster] [demo]

Towards Human Understanding of Paraphrase Types in Language Models

COLING 2025

Dominik Meier, Jan Philip Wahle, Terry Ruas, and Bela Gipp

[pdf] [bibtex] [data] [code]

CADS: A Systematic Literature Review on the Challenges of Abstractive Dialogue Summarization

JAIR 2025

Frederic Kirstein, Jan Philip Wahle, Bela Gipp, Terry Ruas

[pdf] [bibtex]

Overview of the Generated Plagiarism Detection Task at PAN 2025

CLEF 2025 (Working Notes)

André Greiner-Petter, Maik Fröbe, Jan Philip Wahle, Terry Ruas, Bela Gipp, Akiko Aizawa, and Martin Potthast

[pdf] [bibtex] [task]

Paraphrase Types Elicit Prompt Engineering Capabilities

EMNLP 2024 (Main)

Jan Philip Wahle, Terry Ruas, Yang Xu, and Bela Gipp

[pdf] [bibtex] [code] [talk]

What's under the hood: Investigating Automatic Metrics on Meeting Summarization

EMNLP 2024 (Findings)

Frederic Kirstein, Jan Philip Wahle, Terry Ruas, and Bela Gipp

[pdf] [bibtex]

CiteAssist: A System for Automated Preprint Citation and BibTeX Generation

SDProc @ ACL 2024

Lars Kaesberg, Terry Ruas, Jan Philip Wahle, Bela Gipp

[pdf] [bibtex] [code] [demo]

MAGPIE: Multi-Task Media-Bias Analysis of Generalization of Pre-Trained Identification of Expressions

LREC-COLING 2024

Tomáš Horych, Martin Wessel, Jan Philip Wahle, Terry Ruas, Jerome Waßmuth, André Greiner-Petter, Akiko Aizawa, Bela Gipp, Timo Spinde

[pdf] [bibtex] [code]

Text-Guided Image Clustering

EACL 2024 (Main)

Andreas Stephan, Lukas Miklautz, Kevin Sidak, Jan Philip Wahle, Bela Gipp, Claudia Plant, Benjamin Roth

[pdf] [bibtex] [code] [talk]

Paraphrase Types for Generation and Detection

EMNLP 2023 (Main)

Jan Philip Wahle, Bela Gipp, Terry Ruas

[pdf] [bibtex] [code] [demo] [talk]

We are Who We Cite: Bridges of Influence Between NLP and Other Academic Fields

EMNLP 2023 (Main)

Jan Philip Wahle, Terry Ruas, Mohamed Abdalla, Bela Gipp, Saif M. Mohammad

[pdf] [bibtex] [code] [demo] [blog] [talk]

AI Usage Cards: Responsibly Reporting AI-generated Content

JCDL 2023

Jan Philip Wahle, Terry Ruas, Saif M. Mohammad, Norman Meuschke, Bela Gipp

[pdf] [bibtex] [template] [webpage] [talk]

The Elephant in the Room: Analyzing the Presence of Big Tech in NLP Research

ACL 2023 (Main)

Mohamed Abdalla, Jan Philip Wahle, Terry Ruas, Aurelie Névéol, Fanny Ducel, Saif M. Mohammad, Karen Fort

[pdf] [bibtex] [code] [talk]

How Large Language Models are Transforming Machine-Paraphrase Plagiarism

EMNLP 2022 (Main)

Jan Philip Wahle, Terry Ruas, Frederic Kirstein, Bela Gipp

[pdf] [bibtex] [code] [data] [talk]

Analyzing Multi-Task Learning for Abstractive Text Summarization

GEM @ EMNLP 2022

Frederic Kirstein, Jan Philip Wahle, Terry Ruas, Bela Gipp

[pdf] [bibtex] [code]

D3: A Massive Dataset of Scholarly Metadata for Analyzing the State of Computer Science Research

LREC 2022 (Main)

Jan Philip Wahle, Terry Ruas, Saif Mohammad, Bela Gipp

[pdf] [bibtex] [code] [talk]

Identifying Machine-Paraphrased Plagiarism

iConference 2022

Jan Philip Wahle, Terry Ruas, Tomas Foltýnek, Norman Meuschke, Bela Gipp

[pdf] [bibtex] [code] [data] [talk]

Testing the Generalization of Neural Language Models for COVID-19 Misinformation Detection

iConference 2022

Jan Philip Wahle, Nischal Ashok, Terry Ruas, Norman Meuschke, Tirthankar Ghosal, Bela Gipp

[pdf] [bibtex] [code] [talk]

Are Neural Language Models Good Plagiarists? A Benchmark for Neural Paraphrase Detection

JCDL 2021

Jan Philip Wahle, Terry Ruas, Norman Meuschke, Bela Gipp

[pdf] [bibtex] [code+data]

Third-Party Funded Projects

I am managing several third-party funded research projects focusing on natural language processing, multi-agent systems, AI safety, and low-resource AI.

Measuring the Faithfulness and Safety of Large Reasoning Models

Campus Institute Data Science (CIDAS)

Sep 2026 – Aug 2028

Paraphrase Types: A New Paradigm for Paraphrase Generation and Detection

German Research Foundation (DFG)

Feb 2026 – Feb 2029

AI in Museums

Lower Saxony Ministry for Science and Culture (MWK)

Jan 2026 – Dec 2027

EDIKILEX: AI and Lexicography for Early New High German Texts

Lower Saxony Ministry for Science and Culture (MWK)

Nov 2025 – Nov 2029

COPILOT: Multi-Agent Assistance System for Process Optimization and Root Cause Analysis in Process Industry

Federal Ministry for Economic Affairs and Climate Action (BMWK)

Aug 2025 – Nov 2027

Fake News Detection: A Manipulated Content Authenticity System

Korean National Police Agency (KNPA)

Apr 2025 – Dec 2026

Startups and Open-Source

I build products and open-source tools that turn research and everyday problems into useful software, spanning academic integrity, research workflows, and consumer apps.

Bloatless

Personal wellness app for iPhone that helps uncover patterns behind bloating through quick daily check-ins, meal logging, gentle insights, and an AI wellness coach while keeping wellness history on-device.

Check My Thesis

Academic integrity tool for students and researchers. Verifies citations against academic databases and detects AI-generated text at the sentence level before submission.

Swipe Photos

Photo management app for iPhone and Mac. Clear out duplicates, blurry shots, and clutter from your camera roll by swiping through photos to keep or delete.

AI Conference Deadlines

Track submission deadlines for AI, ML, and NLP conferences. Features countdown timers, calendar integration, and customizable notifications.

AI Usage Cards

Generate standardized reports for documenting AI assistance in scientific works. Promotes transparency and responsible AI use in research.

CiteAssist

Automated citation and BibTeX generator for preprints. Instantly create properly formatted citations from arXiv, bioRxiv, and other preprint servers.

Apple Mail AI Plugin

Free, open-source macOS plugin that brings Claude, GPT, and Gemini into Apple Mail. Turn rough notes into polished replies with API keys stored locally for full privacy.

Contact

Dr. Jan Philip Wahle
University of Göttingen
Papendiek 14, Office 0.209
37073 Göttingen, Germany
wahle {at} uni-goettingen {dot} de

Page last updated: July 31, 2026