Home

Reading Group

The paper reading group meets weekly during the semester to discuss papers. Participation is open to all, guests are always welcome; if you are interested in receiving invitations contact the organizer.

Each week we will discuss a different paper. The paper to discuss is announced about one week in advance by the organizer. All participants are expected to read the paper before the meeting. It is recommended to take notes about insights, questions, and other points potentially worth discussing.

The goals of the reading group are:

Critical reflection on scientific work
Practice of reading and argumentation strategies
Exposure to a broad range of research topics
Practice of leading group discussions

The discussion is limited to one hour. The discussion is led by a moderator, who may also set a focus for the discussion. The moderator will kick off the meeting by giving a short summary of the paper and raising a few points for discussion. The moderator should try to incorporate all participants into the discussion. The moderator role rotates through all participants. The moderator is encouraged to help with the selection of a paper that week.

Time and location: Monday 11am-12pm at TCS 360 (remote participation is possible, zoom link on request)

Coordinator: Nadia Nahar (nadian at andrew dot cmu dot edu)

Subscribe for announcements on the [email protected] mailing list here: https://lists.andrew.cmu.edu/mailman/listinfo/feature-prg

Agenda

The archive of discussed papers can be found here.

December 16, 2025

Tian, Zhao, et al. Aligning Requirement for Large Language Model’s Code Generation. arXiv preprint arXiv:2509.01313 (2025). Moderator: Nadia

December 9, 2025

Sarkar, Suproteem K., et al. AI Agents, Productivity, and Higher-Order Thinking: Early Evidence From Software Development. SSRN working paper (2025). Moderator: Hao

December 2, 2025

White, Jules, et al. ChatGPT Prompt Patterns for Improving Code Quality, Refactoring, Requirements Elicitation, and Software Design. Generative AI for Effective Software Development. Springer Nature Switzerland (2024), pp. 71–108. Moderator: Courtney

November 18, 2025

Sporsem, T., et al. Clash of Requirements: Users First vs. Model First. Proceedings of the ACM Symposium on Foundations of Software Engineering (FSE) (2025). Moderator: Christian

November 11, 2025

Wang, Zora Zhiruo, et al. How Do AI Agents Do Human Work? Comparing AI and Human Workflows Across Diverse Occupations. arXiv preprint arXiv:2510.22780 (2025). Moderator: Chenyang

October 28, 2025

Wang, Haoyu, et al. AgentSpec: Customizable Runtime Enforcement for Safe and Reliable LLM Agents. arXiv preprint arXiv:2503.18666 (2025). Moderator: Yining

October 7, 2025

Choudhuri, Rudrajit, et al. What Guides Our Choices? Modeling Developers’ Trust and Behavioral Intentions Towards GenAI. Proceedings of the 47th IEEE/ACM International Conference on Software Engineering (ICSE) (2025). Moderator: Nadia

September 23, 2025

Xiao, Tao, et al. Self-admitted GenAI Usage in Open-Source Software. arXiv preprint arXiv:2507.10422 (2025). Moderator: Hao

September 16, 2025

Feng, et al. Charting Uncertain Waters: A Socio-Technical Framework for Navigating GenAI’s Impact on Open Source Communities. arXiv preprint arXiv:2508.04921 (2025). Moderator: Courtney

September 9, 2025

Buyl, Maarten, et al. AI Alignment at Your Discretion. Proceedings of the 2025 ACM Conference on Fairness, Accountability, and Transparency (2025). Moderator: Chenyang

September 2, 2025

Becker, Joel, Nate Rush, Elizabeth Barnes, and David Rein. Measuring the Impact of Early-2025 AI on Experienced Open-Source Developer Productivity. arXiv preprint arXiv:2507.09089 (2025). Moderator: Christian

July 25, 2025

Wang, Jianwei, et al. LLM-based HSE Compliance Assessment: Benchmark, Performance, and Advancements. arXiv preprint arXiv:2505.22959 (2025). Moderator: Erica

July 18, 2025

Ahmed, Toufique, et al. Can LLMs replace manual annotation of software engineering artifacts?. 2025 IEEE/ACM 22nd International Conference on Mining Software Repositories (MSR). IEEE, 2025. Moderator: Jofred

July 11, 2025

Ginart, Tony, Martin Jinye Zhang, and James Zou. Mldemon: Deployment monitoring for machine learning systems. International Conference on Artificial Intelligence and Statistics. PMLR, 2022. Moderator: Jason

June 27, 2025

Costa, Manuel, et al. Securing AI Agents with Information-Flow Control arXiv preprint arXiv:2505.23643 (2025). Moderator: Aarya

June 20, 2025

Pedro, Rodrigo, et al. From prompt injections to sql injection attacks: How protected is your llm-integrated web application? arXiv preprint arXiv:2308.01990 (2023). Moderator: Abhi

June 13, 2025

Protschky, Dominik, et al. What Gets Measured Gets Improved: Monitoring Machine Learning Applications in their Production Environments. IEEE Access (2025). Moderator: Nadia

June 6, 2025

Shankar, Shreya, et al. We Have No Idea How Models will Behave in Production until Production": How Engineers Operationalize Machine Learning. Proceedings of the ACM on Human-Computer Interaction 8.CSCW1 (2024): 1-34. Moderator: Yining

May 16, 2025

Shao, Yuchen, et al. Are LLMs Correctly Integrated into Software Systems? 2025 IEEE/ACM 47th International Conference on Software Engineering (ICSE). IEEE Computer Society, 2025. Moderator: Alex

May 9, 2025

Ayoola, Bimpe, et al. User Personas Improve Social Sustainability by Encouraging Software Developers to Deprioritize Antisocial Features. arXiv preprint arXiv:2412.10672 (2024). Moderator: Christian

April 25, 2025

Wang, Chenyu, et al. Quality assurance for artificial intelligence: A study of industrial concerns, challenges and best practices. arXiv preprint arXiv:2402.16391 (2024). Moderator: Nadia

April 18, 2025

South, Tobin, et al. Authenticated Delegation and Authorized AI Agents. arXiv preprint arXiv:2501.09674 (2025). Moderator: Yining

Mar 28, 2025

Németh, Brigitta, and Johannes Wachs. Anchor Sponsor Firms in Open Source Software Ecosystems. arXiv preprint arXiv:2502.09060 (2025). Moderator: Hao

Mar 21, 2025

Perry, Neil, et al. Do users write more insecure code with ai assistants?. arXiv preprint arXiv:2211.03622 (2022). Moderator: Courtney

Mar 14, 2025

Hossain, Soneya Binta, and Matthew Dwyer. Togll: Correct and strong test oracle generation with llms. arXiv preprint arXiv:2405.03786 (2024). Moderator: Alex

Feb 21, 2025

Rouge, Phoebe, et al. Checkout checkup: Misuse of payment data from web skimming. 2020. Moderator: Hao

Feb 14, 2025

Titzer, Ben L., et al. Flexible Non-intrusive Dynamic Instrumentation for WebAssembly. Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 3. 2024. Moderator: Chenyang

Feb 7, 2025

Ding, Yangruibo, et al. Vulnerability detection with code language models: How far are we? arXiv preprint arXiv:2403.18624 (2024). Moderator: Christian

Jan 31, 2025

Parnin, Chris, et al. Building Your Own Product Copilot: Challenges, Opportunities, and Needs. arXiv preprint arXiv:2312.14231 (2023). Moderator: Nadia

Jan 17, 2025

Rismani, Shalaleh, et al. From plane crashes to algorithmic harm: applicability of safety engineering frameworks for responsible ML. Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems. 2023. Moderator: Yining

Dec 9, 2024

Paradis, Elise, et al. How much does AI impact development speed? An enterprise-based randomized controlled trial. arXiv preprint arXiv:2410.12944 (2024). Moderator: Courtney

Dec 2, 2024

Devanbu, Prem, Thomas Zimmermann, and Christian Bird. Belief & evidence in empirical software engineering. Proceedings of the 38th international conference on software engineering. 2016. Moderator: Hao

Nov 25, 2024

Shankar, Shreya, Aditya G. Parameswaran, and Eugene Wu. DocETL: Agentic Query Rewriting and Evaluation for Complex Document Processing. arXiv preprint arXiv:2410.12189 (2024). Moderator: Chenyang

Nov 11, 2024

Feng, Nick, et al. Normative Requirements Operationalization with Large Language Models. 2024 IEEE 32nd International Requirements Engineering Conference (RE). IEEE, 2024. Moderator: Christian

Nov 4, 2024

Liang, Jenny T., et al. Prompts are programs too! Understanding how developers build software containing prompts. arXiv preprint arXiv:2409.12447 (2024). Moderator: Nadia

Oct 28, 2024

Rismani, Shalaleh, et al. Beyond the ML Model: Applying Safety Engineering Frameworks to Text-to-Image Development. Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society. 2023. Moderator: Yining

Oct 21, 2024

Mirhosseini, Samim, and Chris Parnin. Can automated pull requests encourage software developers to upgrade out-of-date dependencies?. 2017 32nd IEEE/ACM International Conference on Automated Software Engineering (ASE). IEEE, 2017. Moderator: Courtney

Oct 7, 2024

Moqri, Mahdi, et al. Effect of “following” on contributions to open source communities. Journal of Management Information Systems 35.4 (2018): 1188-1217. Moderator: Hao

Sep 16, 2024

Shankar, Shreya, et al. Who Validates the Validators? Aligning LLM-Assisted Evaluation of LLM Outputs with Human Preferences. arXiv preprint arXiv:2404.12272 (2024). Moderator: Chenyang

Sep 9, 2024

Slovic, Paul. Perception of risk. Science 236, no. 4799 (1987): 280-285. Moderator: Christian

July 23, 2024

Beutel, Alex, et al. Copycatch: stopping group attacks by spotting lockstep behavior in social networks. Proceedings of the 22nd International Conference on World Wide Web. 2013. Moderator: Felix

July 17, 2024

Jacovi, Alon, et al. Formalizing trust in artificial intelligence: Prerequisites, causes and goals of human trust in AI. Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency. 2021. Moderator: Aiden

July 10, 2024

Lipton, Zachary C. The mythos of model interpretability: In machine learning, the concept of interpretability is both important and slippery. Queue 16.3 (2018): 31-57. Moderator: Jacob

July 3, 2024

Ronanki, K., Berger, C., & Horkoff, J. Investigating ChatGPT’s potential to assist in requirements elicitation processes. In 2023 49th Euromicro Conference on Software Engineering and Advanced Applications (SEAA) (pp. 354-361). IEEE. Moderator: Maria

June 26, 2024

Schueller, William, and Johannes Wachs. Modeling interconnected social and technical risks in open source software ecosystems. Collective Intelligence 3.1 (2024): 26339137241231912. Moderator: Noah

June 18, 2024

Kohno, Tadayoshi, Yasemin Acar, and Wulf Loh. Ethical frameworks and computer security trolley problems: Foundations for conversations. 32nd USENIX Security Symposium (USENIX Security 23). pp. 5145-5162. 2023. Moderator: Joshua

June 12, 2024

Bhatt, Umang, Alice Xiang, Shubham Sharma, Adrian Weller, Ankur Taly, Yunhan Jia, Joydeep Ghosh, Ruchir Puri, José MF Moura, and Peter Eckersley. Explainable machine learning in deployment. In Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency, pp. 648-657. 2020. Moderator: Nadia

June 5, 2024

Elizabeth Lin, Igibek Koishybayev, Trevor Dunlap, William Enck, and Alexandros Kapravelos, UntrustIDE: Exploiting Weaknesses in VS Code Extensions, in Proceedings of the ISOC Network and Distributed Systems Symposium (NDSS), Feb. 2024. Moderator: Hao

May 29, 2024

Guizani, Mariam, Aileen Abril Castro-Guzman, Anita Sarma, and Igor Steinmacher. Rules of Engagement: Why and How Companies Participate in OSS. In 2023 IEEE/ACM 45th International Conference on Software Engineering (ICSE), pp. 2617-2629. IEEE, 2023. Moderator: Courtney

May 13, 2024

Du, Kun, et al. Understanding promotion-as-a-service on GitHub. Proceedings of the 36th Annual Computer Security Applications Conference. 2020. Moderator: Hao

April 29, 2024

Zhang, Yuxia, et al. Corporate dominance in open source ecosystems: a case study of OpenStack. Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering. 2022. Moderator: Courtney

April 22, 2024

Ray, Hirak, et al. Why Older Adults (Don't) Use Password Managers. 30th USENIX Security Symposium (USENIX Security 21). 2021. Moderator: Lirong

April 8, 2024

Kim, Tae Soo, et al. Evallm: Interactive evaluation of large language model prompts on user-defined criteria. arXiv preprint arXiv:2309.13633 (2023). Moderator: Chenyang

March 25, 2024

Wang, Zijie J., et al. Farsight: Fostering Responsible AI Awareness During AI Application Prototyping. arXiv preprint arXiv:2402.15350 (2024). Moderator: Nadia

March 18, 2024

Walden, James. The impact of a major security event on an open source project: The case of OpenSSL. Proceedings of the 17th international conference on mining software repositories. 2020. Moderator: Hao

March 11, 2024

Feffer, Michael, et al. Red-Teaming for Generative AI: Silver Bullet or Security Theater?. arXiv preprint arXiv:2401.15897 (2024). Moderator: Christian

Feb 26, 2024

Qiu, Huilian Sophie, et al. The signals that potential contributors look for when choosing open-source projects. Proceedings of the ACM on Human-Computer Interaction3.CSCW (2019): 1-29. Moderator: Courtney

Feb 19, 2024

Mazurek, Michelle L., et al. Access control for home data sharing: Attitudes, needs and practices. Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. 2010. Moderator: Lirong

Feb 12, 2024

Buçinca, Zana, et al. AHA!: Facilitating AI Impact Assessment by Generating Examples of Harms. arXiv preprint arXiv:2306.03280 (2023). Moderator: Nadia

Feb 5, 2024

Khattab, Omar, et al. Dspy: Compiling declarative language model calls into self-improving pipelines. arXiv preprint arXiv:2310.03714 (2023). Moderator: Chenyang

Jan 29, 2024

Unpublished paper draft. Moderator: Christian

Jan 22, 2024

Liu, Chengwei, et al. Demystifying the vulnerability propagation and its evolution via dependency trees in the npm ecosystem. Proceedings of the 44th International Conference on Software Engineering. 2022. Moderator: Hao