Skip to content

zhangshuibai/ComputeScaling-Replication

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ComputeScaling-Replication

replication of part of the huggingface blog https://huggingface.co/spaces/HuggingFaceH4/blogpost-scaling-test-time-compute

Since the details of grading implementation in the blog is not enough to reproduce the results in the blog, i adapted the grading code in the https://github.com/openai/prm800k

Replication of math-psa (https://huggingface.co/openreasoner/Math-psa/tree/main)

Using "last" as the aggregation method: replication results of math-psa

Using "mean" as the aggregation method: replication results of math-psa

Using "min" as the aggregation method: replication results of math-psa

About

reproduce the result in the huggingface blog https://huggingface.co/spaces/HuggingFaceH4/blogpost-scaling-test-time-compute

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published