Skip to main content
Back to King's College London homepage

The Natural Language Processing (NLP) Group at KCL is comprised of PhD and postdoctoral students, professors and others who are interested in solving computational problems related to the understanding of human language. This encompasses a wide range of topics including sentiment analysis, topic/event extraction, question answering, cross-modal retrieval, text illustration, social media analysis and many more, typically approached with machine learning.

All images have been generated using DALL-E.

People

Oana Cocarascu

Senior Lecturer in Artificial Intelligence

PhD Student

Bogdan Grecu

PhD student

Lin Gui

Lecturer in Natural Language Processing

Yulan He

Professor in Natural Language Processing

PhD Student

Projects

An AI-generated image of a network
Event-Centric Framework for Natural Language Understanding

The five-year UKRI-funded Turing AI Fellowship awarded to Yulan He aims to develop a machine reading comprehension model in which a computer could continuously build and update a graph of eventualities as reading progresses.

An AI-generated image of a woman
New Language Modelling

Lin Gui and Yulan He have been awarded a prestigious EPSRC New Horizons grant for a high-risk research project with potentially transformative impact. The project aims to develop a new language modelling method allowing for a more faithful and explainable approximation for the input text.

An AI-generated image of a hand holding a pencil above a notepad
Automated Scoring System for GCSE Science Exams

Funded by AQA, the project aims to develop an automated scoring system for assessing students’ answers to descriptive questions in GCSE Biology or Chemistry. The system is expected to produce prediction of marks and generate the rationales explaining the model decisions.

An AI-generated image of two robots approaching a woman reading a book
Character-Centric Narrative Understanding

The EPSRC ICASE project, jointly funded by Huawei London Research Centre, aims to develop new AI algorithms for automatic understanding of narratives in novels.

An AI-generated image of a human brain
Model Interpretability

In our EPSRC-funded project, “Twenty20Insight”, we aim to investigate explainable AI (XAI) approaches which can provide interpretations both faithful to model decisions and are also better understood by humans.

An AI-generated image of speech bubbles
PANACEA: PANdemic Ai Claim vEracity Assessment

Led by Yulan He, the EPSRC-funded PANACEA project developed novel supervised/unsupervised methods for veracity assessment of claims unverified at the time of posting, by integrating information from multiple sources and building a knowledge network that enables cross verification.

Activities

An AI-generated image of Earth floating in a bottle
Tutorial on Uncertainty Quantification for Text Classification in SIGIR 2023

Yulan He and Lin Gui from King's College London, together with Dell Zhang, Murat Sensoy, Masoud Makrehchi, Bilyana Taneva-Popova at Thomson Reuters, will deliver a tutorial on Uncertainty Quantification for Text Classification in SIGIR 2023 in July 2023.

An AI-generated image of people gathered in a circle
Invited Talk on Uncertainty Interpretation and Calibration of Language Models

Yulan He is invited to give a talk on uncertainty interpretation and calibration of language models in NLDB 2023 in June 2023.

An AI-generated image of a car and cityscape
Invited Talk on NLP Research to Drive FinTech

Yulan He gave an invited talk on “NLP Research to Drive FinTech: Now and Next” in the Gillmore Centre for Financial Technology at Warwick Business School in December 2022.

Events

08Nov

Long Narrative Understanding in the Era of Large Language Models

Prof Yulan He will present her group's latest research on Large Language Models

Please note: this event has passed.

Resources

Code & Dataset

    • NapSS - Paragraph-level Medical Text Simplification
NapSS: Paragraph-level Medical Text Simplification via Narrative Prompting and Sentence-matching Summarization, Findings of EACL, May. 2023.
    • Bayesian-Trans - Extracting Event Temporal Relations
Event Temporal Relation Extraction with Bayesian Translational Model, 2023
    • dynamic Brand-Topic Model (dBTM) - models the evolution of the latent brand polarity scores and the topic-word distributions over time.
Tracking Brand-Associated Polarity-Bearing Topics in User Reviews, Transactions of the Association for Computational Linguistics, accepted.
    • PHEE - A Dataset for Pharmacovigilance Event Extraction from Text
PHEE: A Dataset for Pharmacovigilance Event Extraction from Text, The 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP), Dec. 2022.
    • TranCLR - Event-Centric Question Answering
Event-Centric Question Answering via Contrastive Learning and Invertible Event Transformation, Findings of EMNLP, Dec. 2022.
    • HINT - Hierarchical Interpretation of Neural Text Classification
Hierarchical Interpretation of Neural Text Classification, Computational Linguistics, to appear.
    • XProNet - Cross-modal Prototype Driven Network for Radiology Report Generation
Cross-modal Prototype Driven Network for Radiology Report Generation, 17th European Conference on Computer Vision (ECCV), Oct. 2022.
    • tokenUni - Addressing Token Uniformity in Transformers via Singular Value Transformation
Addressing Token Uniformity in Transformers via Singular Value Transformation, 38th Conference on Uncertainty in Artificial Intelligence (UAI), Aug. 2022.
    • PANACEA dataset - Heterogeneous COVID-19 Claims
Natural Language Inference with Self-Attention for Veracity Assessment of Pandemic Claims, 2022 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), Jul. 2022.
Disentangled Learning of Stance and Aspect Topics for Vaccine Attitude Detection in Social Media, 2022 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), Jul. 2022.
    • hyper-event-TempRel - Poincaré Event Temporal Embeddings and Hyperbolic GRU for Event TempRel Extraction
Extracting Event Temporal Relations via Hyperbolic Geometry, Conference on Empirical Methods in Natural Language Processing (EMNLP), Nov. 2021.
    • Position Bias Mitigation - A Knowledge-Aware Graph Model for Emotion Cause Extraction
Position Bias Mitigation: A Knowledge-Aware Graph Model for Emotion Cause Extraction, The 59th Annual Meeting of the Association for Computational Linguistics (ACL), Aug. 2021.
    • topical_wordvec_models - A Neural Generative Model for Joint Learning Topics and Topic-Specific Word Embeddings
A Neural Generative Model for Joint Learning Topics and Topic-Specific Word Embeddings, Transactions of the Association for Computational Linguistics, accepted.

People

Oana Cocarascu

Senior Lecturer in Artificial Intelligence

PhD Student

Bogdan Grecu

PhD student

Lin Gui

Lecturer in Natural Language Processing

Yulan He

Professor in Natural Language Processing

PhD Student

Projects

An AI-generated image of a network
Event-Centric Framework for Natural Language Understanding

The five-year UKRI-funded Turing AI Fellowship awarded to Yulan He aims to develop a machine reading comprehension model in which a computer could continuously build and update a graph of eventualities as reading progresses.

An AI-generated image of a woman
New Language Modelling

Lin Gui and Yulan He have been awarded a prestigious EPSRC New Horizons grant for a high-risk research project with potentially transformative impact. The project aims to develop a new language modelling method allowing for a more faithful and explainable approximation for the input text.

An AI-generated image of a hand holding a pencil above a notepad
Automated Scoring System for GCSE Science Exams

Funded by AQA, the project aims to develop an automated scoring system for assessing students’ answers to descriptive questions in GCSE Biology or Chemistry. The system is expected to produce prediction of marks and generate the rationales explaining the model decisions.

An AI-generated image of two robots approaching a woman reading a book
Character-Centric Narrative Understanding

The EPSRC ICASE project, jointly funded by Huawei London Research Centre, aims to develop new AI algorithms for automatic understanding of narratives in novels.

An AI-generated image of a human brain
Model Interpretability

In our EPSRC-funded project, “Twenty20Insight”, we aim to investigate explainable AI (XAI) approaches which can provide interpretations both faithful to model decisions and are also better understood by humans.

An AI-generated image of speech bubbles
PANACEA: PANdemic Ai Claim vEracity Assessment

Led by Yulan He, the EPSRC-funded PANACEA project developed novel supervised/unsupervised methods for veracity assessment of claims unverified at the time of posting, by integrating information from multiple sources and building a knowledge network that enables cross verification.

Activities

An AI-generated image of Earth floating in a bottle
Tutorial on Uncertainty Quantification for Text Classification in SIGIR 2023

Yulan He and Lin Gui from King's College London, together with Dell Zhang, Murat Sensoy, Masoud Makrehchi, Bilyana Taneva-Popova at Thomson Reuters, will deliver a tutorial on Uncertainty Quantification for Text Classification in SIGIR 2023 in July 2023.

An AI-generated image of people gathered in a circle
Invited Talk on Uncertainty Interpretation and Calibration of Language Models

Yulan He is invited to give a talk on uncertainty interpretation and calibration of language models in NLDB 2023 in June 2023.

An AI-generated image of a car and cityscape
Invited Talk on NLP Research to Drive FinTech

Yulan He gave an invited talk on “NLP Research to Drive FinTech: Now and Next” in the Gillmore Centre for Financial Technology at Warwick Business School in December 2022.

Events

08Nov

Long Narrative Understanding in the Era of Large Language Models

Prof Yulan He will present her group's latest research on Large Language Models

Please note: this event has passed.

Resources

Code & Dataset

    • NapSS - Paragraph-level Medical Text Simplification
NapSS: Paragraph-level Medical Text Simplification via Narrative Prompting and Sentence-matching Summarization, Findings of EACL, May. 2023.
    • Bayesian-Trans - Extracting Event Temporal Relations
Event Temporal Relation Extraction with Bayesian Translational Model, 2023
    • dynamic Brand-Topic Model (dBTM) - models the evolution of the latent brand polarity scores and the topic-word distributions over time.
Tracking Brand-Associated Polarity-Bearing Topics in User Reviews, Transactions of the Association for Computational Linguistics, accepted.
    • PHEE - A Dataset for Pharmacovigilance Event Extraction from Text
PHEE: A Dataset for Pharmacovigilance Event Extraction from Text, The 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP), Dec. 2022.
    • TranCLR - Event-Centric Question Answering
Event-Centric Question Answering via Contrastive Learning and Invertible Event Transformation, Findings of EMNLP, Dec. 2022.
    • HINT - Hierarchical Interpretation of Neural Text Classification
Hierarchical Interpretation of Neural Text Classification, Computational Linguistics, to appear.
    • XProNet - Cross-modal Prototype Driven Network for Radiology Report Generation
Cross-modal Prototype Driven Network for Radiology Report Generation, 17th European Conference on Computer Vision (ECCV), Oct. 2022.
    • tokenUni - Addressing Token Uniformity in Transformers via Singular Value Transformation
Addressing Token Uniformity in Transformers via Singular Value Transformation, 38th Conference on Uncertainty in Artificial Intelligence (UAI), Aug. 2022.
    • PANACEA dataset - Heterogeneous COVID-19 Claims
Natural Language Inference with Self-Attention for Veracity Assessment of Pandemic Claims, 2022 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), Jul. 2022.
Disentangled Learning of Stance and Aspect Topics for Vaccine Attitude Detection in Social Media, 2022 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), Jul. 2022.
    • hyper-event-TempRel - Poincaré Event Temporal Embeddings and Hyperbolic GRU for Event TempRel Extraction
Extracting Event Temporal Relations via Hyperbolic Geometry, Conference on Empirical Methods in Natural Language Processing (EMNLP), Nov. 2021.
    • Position Bias Mitigation - A Knowledge-Aware Graph Model for Emotion Cause Extraction
Position Bias Mitigation: A Knowledge-Aware Graph Model for Emotion Cause Extraction, The 59th Annual Meeting of the Association for Computational Linguistics (ACL), Aug. 2021.
    • topical_wordvec_models - A Neural Generative Model for Joint Learning Topics and Topic-Specific Word Embeddings
A Neural Generative Model for Joint Learning Topics and Topic-Specific Word Embeddings, Transactions of the Association for Computational Linguistics, accepted.