Hello, I am Somin!
I am a CS Ph.D. candidate in Khoury College of Computer Sciences at Northeastern University, advised by Byron Wallace and Silvio Amir.
I study how to extract and communicate insights from large text corpora by combining natural language processing and structured information extraction. My recent projects span clinical literature, and distillation techniques for large language models.
Selected publications (full list)
- Who Taught You That? Tracing Teachers in Model Distillation. Somin Wadhwa, Chantal Shaib, Silvio Amir and Byron C. Wallace. Association for Computational Linguistics (ACL). Vienna, Austria. July 2025.
- Investigating Mysteries of CoT-Augmented Distillation. Somin Wadhwa, Silvio Amir and Byron C. Wallace. Empirical Methods in Natural Language Processing (EMNLP). Miami, Florida, USA. November 2024.
- Learning from Natural Language Explanations for Generalizable Entity Matching. Somin Wadhwa, Adit Krishnan, Runhui Wang, Byron C. Wallace and Chris Kong. Empirical Methods in Natural Language Processing (EMNLP). Miami, Florida, USA. November 2024.
- Distilling Event Sequence Knowledge From Large Language Models. Somin Wadhwa, Oktie Hassanzadeh, Debarun Bhattacharjya, Ken Barker and Jian Ni. International Semantic Web Conference (ISWC). Baltimore, Maryland, USA. November 2024.
- Revisiting Relation Extraction in the era of Large Language Models. Somin Wadhwa, Silvio Amir and Byron C. Wallace. Association for Computational Linguistics (ACL). Toronto, Canada. July 2023.
trailside musings...
I enjoy hiking the local mountains, running when I can, casting a line whenever the fish are biting, and spending time with my dog.
Always happy to swap trail stories, race routes or dog photos over coffee