Skip to content
Change the repository type filter

All

    Repositories list

    • Code for `LLM2VEC-GEN: Generative Embeddings from Large Language Models`
      Python
      MIT License
      02700Updated Mar 12, 2026Mar 12, 2026
    • Python
      25163Updated Mar 10, 2026Mar 10, 2026
    • Data and code for the paper "Humans and LLMs Diverge on Probabilistic Inferences"
      Jupyter Notebook
      MIT License
      1110Updated Mar 4, 2026Mar 4, 2026
    • Code and data for the paper "LatentLens: Revealing Highly Interpretable Visual Tokens in LLMs"
      Python
      Other
      23200Updated Mar 3, 2026Mar 3, 2026
    • Workshop on AI for Science
      HTML
      0000Updated Feb 17, 2026Feb 17, 2026
    • BRIDGE

      Public
      BRIDGE: Predicting Human Task Completion Time From Model Performance
      HTML
      2100Updated Feb 10, 2026Feb 10, 2026
    • Code for `Exploiting Instruction-Following Retrievers for Malicious Information Retrieval`
      Python
      MIT License
      1600Updated Jan 8, 2026Jan 8, 2026
    • Python
      0000Updated Jan 5, 2026Jan 5, 2026
    • Jupyter Notebook
      MIT License
      21200Updated Dec 8, 2025Dec 8, 2025
    • llm2vec

      Public
      Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'
      Python
      MIT License
      1341.7k384Updated Dec 4, 2025Dec 4, 2025
    • TypeScript
      0000Updated Nov 29, 2025Nov 29, 2025
    • TypeScript
      0000Updated Nov 29, 2025Nov 29, 2025
    • TypeScript
      0000Updated Nov 18, 2025Nov 18, 2025
    • mSTEB

      Public
      Jupyter Notebook
      1100Updated Nov 14, 2025Nov 14, 2025
    • Code for paper "The Markovian Thinker: Architecture-Agnostic Linear Scaling of Reasoning"
      Python
      Apache License 2.0
      2734520Updated Nov 13, 2025Nov 13, 2025
    • MAGNIFICo

      Public
      EMNLP 2023: MAGNIFICo: Evaluating the In-Context Learning Ability of Large Language Models to Generalize to Novel Interpretations
      Python
      0200Updated Nov 7, 2025Nov 7, 2025
    • Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"
      Jupyter Notebook
      MIT License
      5460141Updated Oct 7, 2025Oct 7, 2025
    • Python
      0810Updated Oct 3, 2025Oct 3, 2025
    • llmsafety

      Public
      A fork of JailbreakBench: An Open Robustness Benchmark for Jailbreaking Language Models [NeurIPS 2024 Datasets and Benchmarks Track]
      Python
      MIT License
      61000Updated Oct 1, 2025Oct 1, 2025
    • 0000Updated Sep 22, 2025Sep 22, 2025
    • Python
      3200Updated Sep 10, 2025Sep 10, 2025
    • ACL 2022: An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models.
      Python
      4115400Updated Aug 18, 2025Aug 18, 2025
    • TACL 2025: Investigating Adversarial Trigger Transfer in Large Language Models
      Python
      MIT License
      21900Updated Aug 17, 2025Aug 17, 2025
    • AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories
      Python
      24010Updated Aug 7, 2025Aug 7, 2025
    • R
      MIT License
      0200Updated Jul 30, 2025Jul 30, 2025
    • AfroBench

      Public
      Large Scale Benchmark of Large Language Models on African Languages
      Python
      31700Updated Jul 28, 2025Jul 28, 2025
    • AURORA

      Public
      Code and data for the paper: Learning Action and Reasoning-Centric Image Editing from Videos and Simulation
      Python
      MIT License
      23500Updated Jun 30, 2025Jun 30, 2025
    • Python
      0000Updated Jun 22, 2025Jun 22, 2025
    • VinePPO

      Public
      Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"
      Python
      MIT License
      2318740Updated May 25, 2025May 25, 2025
    • Evaluation dataset for our NAACL 2025 paper on "Does Generative AI speak Nigerian-Pidgin?: Issues about Representativeness and Bias for Multilingualism in LLMs"
      Apache License 2.0
      0000Updated May 14, 2025May 14, 2025