Publications

The publications listed in this section consist of reports and papers produced during my internship experiences.

Tracing Multilingual Representations in LLMs with Cross-Layer Transcoders
Abir Harrasse^*, Florent Drayee^*, Zhijing Jin, Bernhard Schölkopf
Under Review
[arXiv]

TinySQL: A Progressive Text-to-SQL Dataset for Mechanistic Interpretability Research
Abir Harrasse^*, Philip Quirke^*,Clement Neo^*, Dhruv Nathawani, Luke Marks, Amir Abdullah
Accepted to EMNLP 2025 (Main Conference)
[arXiv] [Code]

Disentangling and Steering Multilingual Representations: Layer-Wise Analysis and Cross-Lingual Control in Language Models
Abir Harrasse^*, Florent Draye^*, Bernhard Schölkopf, Zhijing Jin
Accepted to ICML-AIW Workshop
[AIW Workshop]

Activation Space Interventions Can Be Transferred Between Large Language Models
Narmeen Oozeer^*,Dhruv Nathawani^*, Nirmalendu Prakash, Michael Lan, Abir Harrasse, Amirali Abdullah
Accepted to ICML 2025
[arXiv] [Code]