GitHub

Phase 0: January 2025 – August 2025 — Phase 0

We evaluated the performance of various Large Language Models (LLMs) in generating Data Management Plans (DMPs) that complied with the National Institutes of Health (NIH) requirements. This evaluation was based on analyses from two datasets: DMP_Automatic_Evaluation_Analysis and DMP_Human_Evaluation_Analysis.

Phase 1: October 2025 – May 2026 — Phase 1

In Phase 1, we will try different strategies to improve the performance of LLMs, such as Retrieval Augmented Generation (RAG) and prompt engineering. We will also start building dmpchef.org. It will only let users request drafts for NIH DMPs in this Phase. By the end of this Phase, we will learn how much LLMs can be improved by being tuned towards a specific DMP generation task (NIH DMP in this case).

Name		Name	Last commit message	Last commit date
Latest commit History 55 Commits
DMP_Automatic_Evalution_Analysis		DMP_Automatic_Evalution_Analysis
DMP_Human_Evaluation_Analysis		DMP_Human_Evaluation_Analysis
DMP_RAG_Pipeline		DMP_RAG_Pipeline
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Phase 0: January 2025 – August 2025 — Phase 0

Phase 1: October 2025 – May 2026 — Phase 1

About

Uh oh!

Releases

Packages

Languages

fairdataihub/AI_DMP

Folders and files

Latest commit

History

Repository files navigation

Phase 0: January 2025 – August 2025 — Phase 0

Phase 1: October 2025 – May 2026 — Phase 1

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages