Publications

The publications listed in this section consist of reports and papers produced during my internship experiences. Disentangling and Steering Multilingual Representations: Layer-Wise Analysis and Cross-Lingual Control in Language Models
Abir Harrasse*, Florent Draye*, Bernhard Schölkopf, Zhijing Jin
Research Preprint (ICML-AIW Workshop)
[AIW Workshop]

TinySQL: A Progressive Text-to-SQL Dataset for Mechanistic Interpretability Research
Abir Harrasse*, Philip Quirke*,Clement Neo*, Dhruv Nathawani, Luke Marks, Amir Abdullah
Research Preprint (under review)
[arXiv] [Code]

Activation Space Interventions Can Be Transferred Between Large Language Models
Narmeen Oozeer*,Dhruv Nathawani*, Nirmalendu Prakash, Michael Lan, Abir Harrasse, Amirali Abdullah
Research Preprint (ICML 2025)
[arXiv] [Code]