-
Notifications
You must be signed in to change notification settings - Fork 0
Home
The paper reading group meets weekly during the semester to discuss papers. Participation is open to all, guests are always welcome; if you are interested in receiving invitations contact the organizer.
Each week we will discuss a different paper. The paper to discuss is announced about one week in advance by the organizer. All participants are expected to read the paper before the meeting. It is recommended to take notes about insights, questions, and other points potentially worth discussing.
The goals of the reading group are:
- Critical reflection on scientific work
- Practice of reading and argumentation strategies
- Exposure to a broad range of research topics
- Practice of leading group discussions
The discussion is limited to one hour. The discussion is led by a moderator, who may also set a focus for the discussion. The moderator will kick off the meeting by giving a short summary of the paper and raising a few points for discussion. The moderator should try to incorporate all participants into the discussion. The moderator role rotates through all participants. The moderator is encouraged to help with the selection of a paper that week.
Time and location: Monday 11am-12pm at TCS 360 (remote participation is possible, zoom link on request)
Coordinator: Nadia Nahar (nadian at andrew dot cmu dot edu)
Subscribe for announcements on the [email protected] mailing list here: https://lists.andrew.cmu.edu/mailman/listinfo/feature-prg
The archive of discussed papers can be found here.
Tian, Zhao, et al. Aligning Requirement for Large Language Model’s Code Generation. arXiv preprint arXiv:2509.01313 (2025). Moderator: Nadia
Sarkar, Suproteem K., et al. AI Agents, Productivity, and Higher-Order Thinking: Early Evidence From Software Development. SSRN working paper (2025). Moderator: Hao
White, Jules, et al. ChatGPT Prompt Patterns for Improving Code Quality, Refactoring, Requirements Elicitation, and Software Design. Generative AI for Effective Software Development. Springer Nature Switzerland (2024), pp. 71–108. Moderator: Courtney
Sporsem, T., et al. Clash of Requirements: Users First vs. Model First. Proceedings of the ACM Symposium on Foundations of Software Engineering (FSE) (2025). Moderator: Christian
Wang, Zora Zhiruo, et al. How Do AI Agents Do Human Work? Comparing AI and Human Workflows Across Diverse Occupations. arXiv preprint arXiv:2510.22780 (2025). Moderator: Chenyang
Wang, Haoyu, et al. AgentSpec: Customizable Runtime Enforcement for Safe and Reliable LLM Agents. arXiv preprint arXiv:2503.18666 (2025). Moderator: Yining
Choudhuri, Rudrajit, et al. What Guides Our Choices? Modeling Developers’ Trust and Behavioral Intentions Towards GenAI. Proceedings of the 47th IEEE/ACM International Conference on Software Engineering (ICSE) (2025). Moderator: Nadia
Xiao, Tao, et al. Self-admitted GenAI Usage in Open-Source Software. arXiv preprint arXiv:2507.10422 (2025). Moderator: Hao
Feng, et al. Charting Uncertain Waters: A Socio-Technical Framework for Navigating GenAI’s Impact on Open Source Communities. arXiv preprint arXiv:2508.04921 (2025). Moderator: Courtney
Buyl, Maarten, et al. AI Alignment at Your Discretion. Proceedings of the 2025 ACM Conference on Fairness, Accountability, and Transparency (2025). Moderator: Chenyang
Becker, Joel, Nate Rush, Elizabeth Barnes, and David Rein. Measuring the Impact of Early-2025 AI on Experienced Open-Source Developer Productivity. arXiv preprint arXiv:2507.09089 (2025). Moderator: Christian
Wang, Jianwei, et al. LLM-based HSE Compliance Assessment: Benchmark, Performance, and Advancements. arXiv preprint arXiv:2505.22959 (2025). Moderator: Erica
Ahmed, Toufique, et al. Can LLMs replace manual annotation of software engineering artifacts?. 2025 IEEE/ACM 22nd International Conference on Mining Software Repositories (MSR). IEEE, 2025. Moderator: Jofred
Ginart, Tony, Martin Jinye Zhang, and James Zou. Mldemon: Deployment monitoring for machine learning systems. International Conference on Artificial Intelligence and Statistics. PMLR, 2022. Moderator: Jason
Costa, Manuel, et al. Securing AI Agents with Information-Flow Control arXiv preprint arXiv:2505.23643 (2025). Moderator: Aarya
Pedro, Rodrigo, et al. From prompt injections to sql injection attacks: How protected is your llm-integrated web application? arXiv preprint arXiv:2308.01990 (2023). Moderator: Abhi
Protschky, Dominik, et al. What Gets Measured Gets Improved: Monitoring Machine Learning Applications in their Production Environments. IEEE Access (2025). Moderator: Nadia
Shankar, Shreya, et al. We Have No Idea How Models will Behave in Production until Production": How Engineers Operationalize Machine Learning. Proceedings of the ACM on Human-Computer Interaction 8.CSCW1 (2024): 1-34. Moderator: Yining
Shao, Yuchen, et al. Are LLMs Correctly Integrated into Software Systems? 2025 IEEE/ACM 47th International Conference on Software Engineering (ICSE). IEEE Computer Society, 2025. Moderator: Alex
Ayoola, Bimpe, et al. User Personas Improve Social Sustainability by Encouraging Software Developers to Deprioritize Antisocial Features. arXiv preprint arXiv:2412.10672 (2024). Moderator: Christian
Wang, Chenyu, et al. Quality assurance for artificial intelligence: A study of industrial concerns, challenges and best practices. arXiv preprint arXiv:2402.16391 (2024). Moderator: Nadia
South, Tobin, et al. Authenticated Delegation and Authorized AI Agents. arXiv preprint arXiv:2501.09674 (2025). Moderator: Yining
Németh, Brigitta, and Johannes Wachs. Anchor Sponsor Firms in Open Source Software Ecosystems. arXiv preprint arXiv:2502.09060 (2025). Moderator: Hao
Perry, Neil, et al. Do users write more insecure code with ai assistants?. arXiv preprint arXiv:2211.03622 (2022). Moderator: Courtney
Hossain, Soneya Binta, and Matthew Dwyer. Togll: Correct and strong test oracle generation with llms. arXiv preprint arXiv:2405.03786 (2024). Moderator: Alex
Rouge, Phoebe, et al. Checkout checkup: Misuse of payment data from web skimming. 2020. Moderator: Hao
Titzer, Ben L., et al. Flexible Non-intrusive Dynamic Instrumentation for WebAssembly. Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 3. 2024. Moderator: Chenyang
Ding, Yangruibo, et al. Vulnerability detection with code language models: How far are we? arXiv preprint arXiv:2403.18624 (2024). Moderator: Christian
Parnin, Chris, et al. Building Your Own Product Copilot: Challenges, Opportunities, and Needs. arXiv preprint arXiv:2312.14231 (2023). Moderator: Nadia
Rismani, Shalaleh, et al. From plane crashes to algorithmic harm: applicability of safety engineering frameworks for responsible ML. Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems. 2023. Moderator: Yining
Paradis, Elise, et al. How much does AI impact development speed? An enterprise-based randomized controlled trial. arXiv preprint arXiv:2410.12944 (2024). Moderator: Courtney
Devanbu, Prem, Thomas Zimmermann, and Christian Bird. Belief & evidence in empirical software engineering. Proceedings of the 38th international conference on software engineering. 2016. Moderator: Hao
Shankar, Shreya, Aditya G. Parameswaran, and Eugene Wu. DocETL: Agentic Query Rewriting and Evaluation for Complex Document Processing. arXiv preprint arXiv:2410.12189 (2024). Moderator: Chenyang
Feng, Nick, et al. Normative Requirements Operationalization with Large Language Models. 2024 IEEE 32nd International Requirements Engineering Conference (RE). IEEE, 2024. Moderator: Christian
Liang, Jenny T., et al. Prompts are programs too! Understanding how developers build software containing prompts. arXiv preprint arXiv:2409.12447 (2024). Moderator: Nadia
Rismani, Shalaleh, et al. Beyond the ML Model: Applying Safety Engineering Frameworks to Text-to-Image Development. Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society. 2023. Moderator: Yining
Mirhosseini, Samim, and Chris Parnin. Can automated pull requests encourage software developers to upgrade out-of-date dependencies?. 2017 32nd IEEE/ACM International Conference on Automated Software Engineering (ASE). IEEE, 2017. Moderator: Courtney
Moqri, Mahdi, et al. Effect of “following” on contributions to open source communities. Journal of Management Information Systems 35.4 (2018): 1188-1217. Moderator: Hao
Shankar, Shreya, et al. Who Validates the Validators? Aligning LLM-Assisted Evaluation of LLM Outputs with Human Preferences. arXiv preprint arXiv:2404.12272 (2024). Moderator: Chenyang
Slovic, Paul. Perception of risk. Science 236, no. 4799 (1987): 280-285. Moderator: Christian
Beutel, Alex, et al. Copycatch: stopping group attacks by spotting lockstep behavior in social networks. Proceedings of the 22nd International Conference on World Wide Web. 2013. Moderator: Felix
Jacovi, Alon, et al. Formalizing trust in artificial intelligence: Prerequisites, causes and goals of human trust in AI. Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency. 2021. Moderator: Aiden
Lipton, Zachary C. The mythos of model interpretability: In machine learning, the concept of interpretability is both important and slippery. Queue 16.3 (2018): 31-57. Moderator: Jacob
Ronanki, K., Berger, C., & Horkoff, J. Investigating ChatGPT’s potential to assist in requirements elicitation processes. In 2023 49th Euromicro Conference on Software Engineering and Advanced Applications (SEAA) (pp. 354-361). IEEE. Moderator: Maria
Schueller, William, and Johannes Wachs. Modeling interconnected social and technical risks in open source software ecosystems. Collective Intelligence 3.1 (2024): 26339137241231912. Moderator: Noah
Kohno, Tadayoshi, Yasemin Acar, and Wulf Loh. Ethical frameworks and computer security trolley problems: Foundations for conversations. 32nd USENIX Security Symposium (USENIX Security 23). pp. 5145-5162. 2023. Moderator: Joshua
Bhatt, Umang, Alice Xiang, Shubham Sharma, Adrian Weller, Ankur Taly, Yunhan Jia, Joydeep Ghosh, Ruchir Puri, José MF Moura, and Peter Eckersley. Explainable machine learning in deployment. In Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency, pp. 648-657. 2020. Moderator: Nadia
Elizabeth Lin, Igibek Koishybayev, Trevor Dunlap, William Enck, and Alexandros Kapravelos, UntrustIDE: Exploiting Weaknesses in VS Code Extensions, in Proceedings of the ISOC Network and Distributed Systems Symposium (NDSS), Feb. 2024. Moderator: Hao
Guizani, Mariam, Aileen Abril Castro-Guzman, Anita Sarma, and Igor Steinmacher. Rules of Engagement: Why and How Companies Participate in OSS. In 2023 IEEE/ACM 45th International Conference on Software Engineering (ICSE), pp. 2617-2629. IEEE, 2023. Moderator: Courtney
Du, Kun, et al. Understanding promotion-as-a-service on GitHub. Proceedings of the 36th Annual Computer Security Applications Conference. 2020. Moderator: Hao
Zhang, Yuxia, et al. Corporate dominance in open source ecosystems: a case study of OpenStack. Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering. 2022. Moderator: Courtney
Ray, Hirak, et al. Why Older Adults (Don't) Use Password Managers. 30th USENIX Security Symposium (USENIX Security 21). 2021. Moderator: Lirong
Kim, Tae Soo, et al. Evallm: Interactive evaluation of large language model prompts on user-defined criteria. arXiv preprint arXiv:2309.13633 (2023). Moderator: Chenyang
Wang, Zijie J., et al. Farsight: Fostering Responsible AI Awareness During AI Application Prototyping. arXiv preprint arXiv:2402.15350 (2024). Moderator: Nadia
Walden, James. The impact of a major security event on an open source project: The case of OpenSSL. Proceedings of the 17th international conference on mining software repositories. 2020. Moderator: Hao
Feffer, Michael, et al. Red-Teaming for Generative AI: Silver Bullet or Security Theater?. arXiv preprint arXiv:2401.15897 (2024). Moderator: Christian
Qiu, Huilian Sophie, et al. The signals that potential contributors look for when choosing open-source projects. Proceedings of the ACM on Human-Computer Interaction3.CSCW (2019): 1-29. Moderator: Courtney
Mazurek, Michelle L., et al. Access control for home data sharing: Attitudes, needs and practices. Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. 2010. Moderator: Lirong
Buçinca, Zana, et al. AHA!: Facilitating AI Impact Assessment by Generating Examples of Harms. arXiv preprint arXiv:2306.03280 (2023). Moderator: Nadia
Khattab, Omar, et al. Dspy: Compiling declarative language model calls into self-improving pipelines. arXiv preprint arXiv:2310.03714 (2023). Moderator: Chenyang
Unpublished paper draft. Moderator: Christian
Liu, Chengwei, et al. Demystifying the vulnerability propagation and its evolution via dependency trees in the npm ecosystem. Proceedings of the 44th International Conference on Software Engineering. 2022. Moderator: Hao