Arkil Patel (original) (raw)

I am a PhD student in Computer Science at Mila and McGill where I am supervised by Prof. Dzmitry Bahdanau and Prof. Siva Reddy. Previously, I spent 2.5 amazing years as a Research Fellow at Microsoft Research India, where I worked with Dr. Navin Goyal. I also interned with the AllenNLP team at the Allen Institute for Artificial Intelligence (AI2). At AI2, I worked with Pradeep Dasigi on evaluating code generation in LLMs.

I do research in Machine Learning on various interesting aspects surrounding Large Language Models (LLMs). My work focuses on understanding and evaluating the abilities and limitations of LLMs with respect to their generalization, scaling, or reasoning behaviours. My hope is that the knowledge derived from my analysis works will help design more robust systems capable of better out-of-distribution (OOD) generalization, and eventually of exhibiting superintelligence.

Keywords: generalization, reasoning, scaling, evaluation, safety, analysis and interpretability

I graduated with B.E. (Hons.) in Computer Science from BITS Pilani - Goa Campus, India in 2020. For more details about my background, refer to my CV. If you'd like to chat with me about my work or research in general, feel free to reach out!

Apr 01, 2025

Our Thoughtology paper investigating the reasoning chains-of-thoughts of the Large Reasoning Model DeepSeek-R1 is out!

Mar 30, 2025

Our paper on AI safety investigating the transferability of adversarial triggers in LLMs has been accepted to TACL!

Google Scholar| Semantic Scholar

Reviewer COLM 2024 ACL Rolling Review ACL 2023 EMNLP 2021, 2022, 2023 NAACL 2021 AAAI 2022

BITS Pilani

2016 - 2020

Microsoft Research India

2019 - 2022

Allen Institute for AI

Summer 2023

Mila - Quebec AI Institute

2022 - Present

McGill University

2022 - Present

Template: Sebastin