Addressing if, and how, machine learning and AI technologies can be used towards fair and equitable futures

Faculty Research

12 Nov

Written By CTMF Admin

Zeerak Talat

Chancellor’s Fellow in Responsible Machine Learning and Artificial Intelligence

View bio

Research Areas of Expertise:

Ethical and responsible natural language processing, machine learning, and artificial intelligence for socially contingent data.

Research Summary:

Zeerak’s research is motivated by the realisation that if we must have machine learning systems in our society, then we bear a responsible to identify desirable traits for such systems. Their work seeks to do this by viewing machine learning technologies through the lens of content moderation, as concerns and harms arising from technologies very quickly become apparent when the technologies might come to govern our speech. For this reason, Zeerak’s work understanding and critiquing machine learning systems borrows from multiple fields, including natural language processing, science and technology studies, and media studies. Zeerak’s work has been published in conferences such as the Annual Meetings of the Association of Computational Linguistics (ACL), the ACM Conference on Fairness, Accountability, and Transparency (FAccT), and in collected volumes on online harms and artificial intelligence.

Key Publications:

‘Exploring the Limitations of Detecting Machine-Generated Text.’ Jad Doughman, Osama Mohammed Afzal, Hawau Olamide Toyin, Shady Shehata, Preslav Nakov, Zeerak Talat. Proceedings of the 31st International Conference on Computational Linguistics, pages 4274–4281 January 19–24, 2025. 🔗
‘The Only Way is Ethics: A Guide to Ethical Research with Large Language Models.’ Eddie L. Ungless, Nikolas Vitsakis, Zeerak Talat, James Garforth, Björn Ross, Arno Onken, Atoosa Kasirzadeh, Alexandra Birch. Proceedings of the 31st International Conference on Computational Linguistics, pages 8992–9005 January 19–24, 2025. 🔗
Ethics Whitepaper: Whitepaper on Ethical Research into Large Language Models. Eddie L. Ungless, Nikolas Vitsakis, Zeerak Talat, James Garforth, Björn Ross, Arno Onken, Atoosa Kasirzadeh, Alexandra Birch. 2024. 🔗
‘Understanding “Democratization” in NLP and ML Research.’ Arjun Subramonian, Vagrant Gautam, Dietrich Klakow, and Zeerak Talat. In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, pages 3151–3166, Miami, Florida, USA. Association for Computational Linguistics. 2024. 🔗
‘Metrics for What, Metrics for Whom: Assessing Actionability of Bias Evaluation Metrics in NLP.’ Pieter Delobelle, Giuseppe Attanasio, Debora Nozza, Su Lin Blodgett, and Zeerak Talat. In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, pages 21669–21691, Miami, Florida, USA. Association for Computational Linguistics. 2024. 🔗
‘Impoverished Language Technology: The Lack of (Social) Class in NLP’. Cercas Curry, Amanda, Zeerak Talat, and Dirk Hovy. 2024. In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), eds. Nicoletta Calzolari, Min-Yen Kan, Veronique Hoste, Alessandro Lenci, Sakriani Sakti, and Nianwen Xue. Torino, Italia: ELRA and ICCL, 8675–82. 🔗
‘The Perspectivist Paradigm Shift: Assumptions and Challenges of Capturing Human Labels’. Fleisig, Eve, Su Lin Blodgett, Dan Klein, and Zeerak Talat. 2024. In Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), Mexico City, Mexico: Association for Computational Linguistics, 2279–92. 🔗
‘It’s Incomprehensible: On Machine Learning and Decoloniality’. Birhane, Abeba, and Zeerak Talat. 2023. In Handbook of Critical Studies of Artificial Intelligence, ed. Simon Lindgren. Edward Elgar Publishing, 128–40. 🔗
‘Mirages. On Anthropomorphism in Dialogue Systems’. Abercrombie, Gavin, Amanda Curry, Tanvi Dinkar, Verena Rieser, and Zeerak Talat. 2023. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, Singapore: Association for Computational Linguistics, 4776–90. 🔗
‘A Federated Approach for Hate Speech Detection’. Gala, Jay, Deep Gandhi, Jash Mehta, and Zeerak Talat. 2023. In Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, Dubrovnik, Croatia: Association for Computational Linguistics, 3248–59. 🔗
‘Thorny Roses: Investigating the Dual Use Dilemma in Natural Language Processing’. Kaffee, Lucie-Aimée, Arnav Arora, Zeerak Talat, and Isabelle Augenstein. 2023. In Findings of the Association for Computational Linguistics: EMNLP 2023, Singapore: Association for Computational Linguistics, 13977–98. 🔗
‘Futures for Research on Hate Speech in Online Social Media Platforms’. Kirtz, Jaimie Lee, and Zeerak Talat. 2023. eds. Christian Strippel, Sünje Paasch-Colberg, Martin Emmer, and Joachim Trebbe. 🔗
‘Directions for NLP Practices Applied to Online Hate Speech Detection’. Fortuna, Paula, Monica Dominguez, Leo Wanner, and Zeerak Talat. 2022. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, Abu Dhabi, United Arab Emirates: Association for Computational Linguistics, 11794–805. 🔗
‘Data Governance in the Age of Large-Scale Data-Driven Language Technology’. Jernite, Yacine, Huu Nguyen, Stella Biderman, Anna Rogers, Maraim Masoud, Valentin Danchev, Samson Tan, et al. 2022. In 2022 ACM Conference on Fairness, Accountability, and Transparency, Seoul Republic of Korea: ACM, 2206–22. 🔗
‘On the Machine Learning of Ethical Judgments from Natural Language’. Talat, Zeerak, Hagen Blix, Josef Valvoda, Maya Indira Ganesh, Ryan Cotterell, and Adina Williams. 2022. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Seattle, United States: Association for Computational Linguistics, 769–79. 🔗
‘You Reap What You Sow: On the Challenges of Bias Evaluation under Multilingual Settings’. Talat, Zeerak, Aurélie Névéol, Stella Biderman, Miruna Clinciu, Manan Dey, Shayne Longpre, Sasha Luccioni, et al. 2022. In Proceedings of BigScience Episode #5 – Workshop on Challenges & Perspectives in Creating Large Language Models, virtual+Dublin: Association for Computational Linguistics, 26–41. 🔗
‘A Survey of Race, Racism, and Anti-Racism in NLP’. Field, Anjalie, Su Lin Blodgett, Zeerak Talat, and Yulia Tsvetkov. 2021. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), Online: Association for Computational Linguistics, 1905–25. 🔗
‘Understanding Abuse: A Typology of Abusive Language Detection Subtasks’. Talat, Zeerak, Thomas Davidson, Dana Warmsley, and Ingmar Weber. 2017. In Proceedings of the First Workshop on Abusive Language Online, Vancouver, BC, Canada: Association for Computational Linguistics, 78–84. 🔗
Talat, Zeerak, and Dirk Hovy. 2016. ‘Hateful Symbols or Hateful People? Predictive Features for Hate Speech Detection on Twitter’. In Proceedings of the NAACL Student Research Workshop, San Diego, California: Association for Computational Linguistics, 88–93. 🔗