Publications

Publications by categories in reversed chronological order.

2026

2026

  1. Responsible Evaluation of AI for Mental Health
    Hiba Arnaout, Anmol Goel, H Andrew Schwartz, and 8 more authors
    arXiv preprint arXiv:2602.00065, 2026

2025

2025

  1. Do LLMs Suppress Naïve Theories? Investigating Scientific Reasoning and Development in GPT-4o
    Sneh Gupta, Raj Sanjay Shah, and Sashank Varma
    Advances in Cognitive Systems, 2025
  2. The World According to LLMs: How Geographic Origin Influences LLMs’ Entity Deduction Capabilities
    Harsh Nishant Lalai, Raj Sanjay Shah, Jiaxin Pei, and 3 more authors
    In Second Conference on Language Modeling, 2025
  3. Guiding a user to interact with an intelligent computing system using best practices
    Michelle Brachman, Zahra Ashktorab, Michael Desmond, and 5 more authors
    Jun 2025
    US Patent App. 18/542,554
  4. Can llm-simulated practice and feedback upskill human counselors? a randomized study with 90+ novice counselors
    Ryan Louie, Raj Sanjay Shah, Ifdita Hasan Orney, and 3 more authors
    2025
  5. Helping the helper: Supporting peer counselors via ai-empowered practice and feedback
    Shang-Ling Hsu, Raj Sanjay Shah, Prathik Senthil, and 4 more authors
    Proceedings of the ACM on Human-Computer Interaction, 2025
  6. From Intentions to Techniques: A Comprehensive Taxonomy and Challenges in Text Watermarking for Large Language Models
    Harsh Nishant Lalai, Aashish Anantha Ramakrishnan, Raj Sanjay Shah, and 1 more author
    In Findings of the Association for Computational Linguistics: NAACL 2025, Apr 2025
  7. TN-Eval: Rubric and Evaluation Protocols for Measuring the Quality of Behavioral Therapy Notes
    Raj Sanjay Shah, Lei Xu, Qianchu Liu, and 3 more authors
    In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 6: Industry Track), Jul 2025
  8. Findings of the Third BabyLM Challenge: Accelerating Language Modeling Research with Cognitively Plausible Data
    Lucas Charpentier, Leshem Choshen, Ryan Cotterell, and 8 more authors
    2025
  9. The potential–and the pitfalls–of using pre-trained language models as cognitive science theories
    Raj Sanjay Shah and Sashank Varma
    2025
  10. The unlearning mirage: A dynamic framework for evaluating LLM unlearning
    Raj Sanjay Shah, Jing Huang, Keerthiram Murugesan, and 2 more authors
    In Second Conference on Language Modeling, 2025

2024

2024

  1. Multi-Level Feedback Generation with Large Language Models for Empowering Novice Peer Counselors
    Alicja Chaszczewicz, Raj Sanjay Shah, Ryan Louie, and 3 more authors
    In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, 2024
  2. Natural Mitigation of Catastrophic Interference: Continual Learning in Power-Law Learning Environments
    Raj Sanjay Shah, Atith Gandhi, Vijay Marupudi, and 1 more author
    2024
  3. What Makes Digital Support Effective? How Therapeutic Skills Affect Clinical Well-Being
    Wenjie Yang, Anna Fang, Raj Sanjay Shah, and 4 more authors
    Proceedings of the ACM on Human-Computer Interaction, 2024
  4. LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing
    Jiangshu Du, Yibo Wang, Wenting Zhao, and 8 more authors
    In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
  5. How Well Do Deep Learning Models Capture Human Concepts? The Case of the Typicality Effect
    Siddhartha K. Vemuri, Raj Sanjay Shah, and Sashank Varma
    Proceedings of the 46th Annual Meeting of the Cognitive Science Society, 2024
  6. Incremental Comprehension of Garden-Path Sentences by Large Language Models: Semantic Interpretation, Syntactic Re-Analysis, and Attention
    Andrew Li, Xianle Feng, Siddhant Narang, and 4 more authors
    Proceedings of the Annual Meeting of the Cognitive Science Society, 2024
  7. shah2024.png
    Development of Cognitive Intelligence in Pre-trained Language Models
    Raj Sanjay Shah, Khushi Bhardwaj, and Sashank Varma
    Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
  8. Understanding Graphical Perception in Data Visualization through Zero-Shot Prompting of Vision-Language Models
    Grace Guo*, Jenna Jiayi Kang*, Raj Sanjay Shah*, and 2 more authors
    NeurIPS 2024 Workshop on Behavioral Machine Learning, 2024

2023

2023

  1. Pre-training LLMs Using a Human-Like Development Data Corpus
    Khushi Bhardwaj, Raj Sanjay Shah, and Sashank Varma
    Proceedings of the BabyLM Challenge at the 27th Conference on Computational Linguistics, 2023
  2. Numeric Magnitude Comparison Effects in Large Language Models
    Raj Sanjay Shah, Vijay Marupudi, Reba Koenen, and 2 more authors
    In Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022

2022

  1. Modeling Motivational Interviewing Strategies on an Online Peer-to-Peer Counseling Platform
    Raj Sanjay Shah, Faye Holt, Shirley Anugrah Hayati, and 4 more authors
    Proceedings of the ACM on Human-Computer Interaction, 2022
  2. When FLUE Meets FLANG: Benchmarks and Large Pretrained Language Models for the Financial Domain
    Raj Sanjay Shah, Kunal Chawla, Dheeraj Eidnani, and 7 more authors
    In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
  3. JARVix at SemEval-2022 Task 2: It Takes One to Know One? Idiomaticity Detection Using Zero and One-Shot Learning
    Ashwin Pathak, Raj Sanjay Shah, Vaibhav Kumar, and 1 more author
    Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval), 2022

2021

2021

  1. Bitcoin Data Analytics: Scalable Techniques for Transaction Clustering and Embedding Generation
    Raj Sanjay Shah, Ashutosh Bhatia, Atith Gandhi, and 1 more author
    In 2021 International Conference on Communication Systems & Networks (COMSNETS), 2021

2020

2020

  1. CTI-Twitter: Gathering Cyber Threat Intelligence from Twitter Using Integrated Supervised and Unsupervised Learning
    Linn-Mari Kristiansen, Vinti Agarwal, Katrin Franke, and 1 more author
    In 2020 IEEE International Conference on Big Data (BigData), 2020