Skip to content
Change the repository type filter

All

    Repositories list

    • MOSS-TTS

      Public
      MOSS‑TTS Family is an open‑source speech and sound generation model family from MOSI.AI and the OpenMOSS team. It is designed for high‑fidelity, high‑expressive…
      Python
      Apache License 2.0
      2542.8k52Updated Jun 2, 2026Jun 2, 2026
    • Performant framework for training, analyzing and visualizing Sparse Autoencoders (SAEs) and their frontier variants.
      Python
      2921980Updated Jun 2, 2026Jun 2, 2026
    • A real-time video understanding foundation model with gated cross-attention. Offline & real-time inference.
      Python
      Apache License 2.0
      413800Updated Jun 1, 2026Jun 1, 2026
    • MOSS-VL

      Public
      MOSS-VL is the core multimodal model series within the OpenMOSS ecosystem, dedicated to visual understanding.
      Python
      Apache License 2.0
      425600Updated Jun 1, 2026Jun 1, 2026
    • MOSS

      Public
      An open-source tool-augmented conversational language model from Fudan University
      Python
      Apache License 2.0
      1.1k12k2356Updated May 27, 2026May 27, 2026
    • .github

      Public
      0000Updated May 27, 2026May 27, 2026
    • MOSS-TTS-Nano is an open-source multilingual tiny speech generation model from MOSI.AI and the OpenMOSS team. With only 0.1B parameters, it is designed for real…
      Python
      Apache License 2.0
      4323.3k455Updated May 26, 2026May 26, 2026
    • A curated, continuously updated reading list, paper blogs, and resources for World Action Models (WAMs) in embodied AI.
      HTML
      MIT License
      1663312Updated May 24, 2026May 24, 2026
    • MOSS-Audio is an open-source foundation model for unified audio understanding, enabling speech, sound, music, captioning, QA, and reasoning in real-world scenar…
      Python
      36511110Updated May 19, 2026May 19, 2026
    • sglang

      Public
      Python
      Apache License 2.0
      0300Updated May 12, 2026May 12, 2026
    • Vue
      0500Updated May 11, 2026May 11, 2026
    • MOSS-Music is an open-source music understanding model for targeting musical captioning, lyrics ASR, structural analysis, chord / key / tempo reasoning, and lon…
      Python
      58320Updated May 9, 2026May 9, 2026
    • JavaScript
      54600Updated May 7, 2026May 7, 2026
    • MOVA

      Public
      MOVA: Towards Scalable and Synchronized Video–Audio Generation
      Python
      Apache License 2.0
      871k333Updated May 6, 2026May 6, 2026
    • MOSS-Audio-Tokenizer is a Causal Transformer-based audio tokenizer built on the CAT architecture. Trained on 3M hours of diverse audio, it supports streaming an…
      Python
      Apache License 2.0
      1521831Updated May 6, 2026May 6, 2026
    • mlx-audio

      Public
      A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Sil…
      Python
      MIT License
      614600Updated Apr 27, 2026Apr 27, 2026
    • CSS
      1100Updated Apr 13, 2026Apr 13, 2026
    • llama.cpp

      Public
      C++
      MIT License
      2502Updated Apr 8, 2026Apr 8, 2026
    • BandPO

      Public
      Official implementation of BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforcement Learning. BandPO replaces canoni…
      Python
      GNU General Public License v3.0
      44900Updated Apr 8, 2026Apr 8, 2026
    • Python
      0700Updated Apr 3, 2026Apr 3, 2026
    • A library for mechanistic interpretability of GPT-style language models
      Python
      MIT License
      580200Updated Mar 31, 2026Mar 31, 2026
    • Embodied-Planner-R1: Unleashing Embodied Task Planning Ability in LLMs via Reinforcement Learning
      Python
      Apache License 2.0
      12704Updated Mar 30, 2026Mar 30, 2026
    • DiRL

      Public
      Python
      Apache License 2.0
      716001Updated Mar 30, 2026Mar 30, 2026
    • OurClaw

      Public
      Institutional OpenClaw Solution. Share One Claw with Others.
      TypeScript
      MIT License
      32400Updated Mar 30, 2026Mar 30, 2026
    • RoboOmni

      Public
      Official code of "RoboOmni: Proactive Robot Manipulation in Omni-modal Context"
      Python
      610960Updated Mar 28, 2026Mar 28, 2026
    • MOSS-TTSD

      Public
      MOSS-TTSD is a spoken dialogue generation model designed for expressive multi-speaker synthesis. It features long-context modeling, flexible speaker control, a…
      Python
      Apache License 2.0
      1311.3k520Updated Mar 23, 2026Mar 23, 2026
    • TTSD-eval

      Public
      Python
      0400Updated Mar 16, 2026Mar 16, 2026
    • JavaScript
      0200Updated Mar 3, 2026Mar 3, 2026
    • Website

      Public
      wangye
      JavaScript
      3001Updated Mar 2, 2026Mar 2, 2026
    • FRoM-W1

      Public
      [ArXiv 26] FRoM-W1: Towards General Humanoid Whole-Body Control with Language Instructions
      Python
      Apache License 2.0
      716630Updated Feb 13, 2026Feb 13, 2026
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.