Skip to content
Change the repository type filter

All

    Repositories list

    • [ICLR 2025] Language Imbalance Driven Rewarding for Multilingual Self-improving
      Python
      12500Updated Apr 6, 2026Apr 6, 2026
    • TROVE

      Public
      [ACL 2025] TROVE: A Challenge for Fine-Grained Text Provenance via Source Sentence Tracing and Relationship Classification
      Python
      0400Updated Dec 23, 2025Dec 23, 2025
    • KTAE

      Public
      NeurIPS 2025
      Python
      Apache License 2.0
      0110Updated Oct 27, 2025Oct 27, 2025
    • LR2Bench

      Public
      [ACL 2025 Findings] LR^2Bench: Evaluating Long-chain Reflective Reasoning Capabilities of Large Language Models via Constraint Satisfaction Problems
      Python
      0400Updated Sep 30, 2025Sep 30, 2025
    • [ACL 2025 Findings] Implicit Cross-Lingual Rewarding for Efficient Multilingual Preference Alignment
      Python
      1700Updated Aug 25, 2025Aug 25, 2025
    • 33110Updated Jul 23, 2025Jul 23, 2025
    • LADM

      Public
      [ACL 2025] LADM: Long-context Training Data Selection with Attention-based Dependency Measurement for LLMs
      Python
      0900Updated Jun 12, 2025Jun 12, 2025
    • TokAlign

      Public
      [ACL 2025] TokAlign: Efficient Vocabulary Adaptation via Token Alignment
      Python
      MIT License
      21120Updated Jun 4, 2025Jun 4, 2025
    • DMoE

      Public
      [ACL 2025 Findings] Group then Scale: Dynamic Mixture-of-Experts Multilingual Language Model
      Python
      MIT License
      0300Updated Jun 2, 2025Jun 2, 2025
    • [ACL 2024 Findings] Official code and data for ACL-2024 paper "X-Instruction: Aligning Language Model in Low-resource Languages with Self-curated Cross-lingual …
      Python
      MIT License
      1900Updated Jun 19, 2024Jun 19, 2024
    • BigTranslate: Augmenting Large Language Models with Multilingual Translation Capability over 100 Languages
      Python
      1322900Updated Nov 21, 2023Nov 21, 2023
    • Official code for EMNLP-2023 paper "Interpreting and Exploiting Functional Specialization in Multi-Head Attention under Multi-task Learning"
      Python
      MIT License
      0500Updated Oct 11, 2023Oct 11, 2023
    • Python
      MIT License
      2700Updated Oct 18, 2022Oct 18, 2022
    • ATSum

      Public
      The code for ACL2020 paper "Attend, Translate and Summarize: An Efficient Method for Neural Cross-Lingual Summarization"
      Roff
      Other
      44700Updated Dec 18, 2020Dec 18, 2020
    • Datasets for EMNLP-IJCNLP 2019 paper "NCLS:Neural Cross-Lingual Summarization"
      Python
      Other
      97400Updated Nov 18, 2020Nov 18, 2020
    • SOTA-MT

      Public
      This project attempts to maintain the SOTA performance in machine translation
      1110801Updated Sep 21, 2020Sep 21, 2020
    • ASN

      Public
      Dataset for EMNLP-2019 (Attribute-aware Sequence Network for Review Summarization) paper
      1000Updated Aug 27, 2019Aug 27, 2019
    • sb-nmt

      Public
      Code for Synchronous Bidirectional Neural Machine Translation (SB-NMT)
      Python
      166600Updated May 16, 2019May 16, 2019
    • rnmt

      Public
      A C++/CUDA toolkit for neural machine translation (RNN-Based NMT) across multiple GPUs
      C++
      1000Updated May 11, 2019May 11, 2019
    • A C++/CUDA toolkit for Transformer (NMT) Translator (Decoder)
      C++
      5000Updated Jan 7, 2019Jan 7, 2019
    • A simple TensorFlow implementation of the Transformer
      Python
      6000Updated Jan 7, 2019Jan 7, 2019
    • NewsCrawler for Xinhua news.
      Python
      34000Updated Dec 25, 2018Dec 25, 2018
    • USN

      Public
      Dataset for AAAI-2019 paper
      3000Updated Dec 10, 2018Dec 10, 2018
    • HUARN

      Public
      Code for the COLING 2018 paper "Document-level Multi-aspect Sentiment Classification by Jointly Modeling Users, Aspects, and Overall Ratings"
      Python
      MIT License
      5000Updated Dec 10, 2018Dec 10, 2018
    • Python
      1100Updated Aug 13, 2018Aug 13, 2018
    • code and data
      Python
      3100Updated Nov 1, 2017Nov 1, 2017
    • Tallip paper code data
      Roff
      2100Updated Feb 8, 2017Feb 8, 2017
    • A C++ toolkit for neural machine translation for CPU
      C++
      25000Updated Sep 2, 2016Sep 2, 2016
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.