Skip to content
Change the repository type filter

All

    Repositories list

    • data-juicer

      Public
      Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷
      Python
      Apache License 2.0
      3616.3k3524Updated Apr 22, 2026Apr 22, 2026
    • 🤖 Your Intelligent Copilot for Data Exploration and Processing Pipeline
      Python
      Apache License 2.0
      83510Updated Apr 20, 2026Apr 20, 2026
    • data-juicer-sphinx

      Public
      Apache License 2.0
      1000Updated Mar 12, 2026Mar 12, 2026
    • data-juicer-hub

      Public
      Community-driven data-juicer recipes and best practices for various pre-training/fine-tuning tasks.
      Apache License 2.0
      31000Updated Feb 12, 2026Feb 12, 2026
    • A Feedback-Driven Suite for Multimodal Data-Model Co-development.
      Python
      Apache License 2.0
      5500Updated Jan 15, 2026Jan 15, 2026
    • recognize-anything

      Public
      Open-source and strong foundation image recognition models. Self-modified version.
      Jupyter Notebook
      Apache License 2.0
      326000Updated Jan 12, 2026Jan 12, 2026
    • transformers-stream-generator

      Public
      This is a text generation method which returns a generator, streaming out each token in real-time during inference, based on Huggingface/Transformers. Self-modi…
      Python
      MIT License
      20000Updated Nov 15, 2025Nov 15, 2025
    • .github

      Public
      0000Updated Nov 5, 2025Nov 5, 2025
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.