Skip to content
@OpenMOSS

OpenMOSS (SII)

OpenMOSS Team is a research group under the Shanghai Innovation Institution (SII), working in close collaboration with Fudan University and MOSI Intelligence.

Introduction 👋

OpenMOSS Team is a research group under the Shanghai Innovation Institution (SII), working in close collaboration with Fudan University and MOSI Intelligence. Led by Prof. Xipeng Qiu, the team conducts cutting-edge research on large language models (LLMs), advancing the frontiers of model architecture, evaluation, and application with a strong commitment to open, collaborative, and impactful AI innovation.

We warmly welcome researchers, students, and collaborators who share our vision to join us in pushing the boundaries of LLM technology. For inquiries or collaboration opportunities, please contact us at openmoss@sii.edu.cn .

🌐 Website: https://openmoss.github.io/ or http://openmoss.sii.edu.cn/

💻 GitHub: https://github.com/OpenMOSS

  • SII is dedicated to fostering innovation in education and research in the field of artificial intelligence.

Pinned Loading

  1. MOSS MOSS Public

    An open-source tool-augmented conversational language model from Fudan University

    Python 12.1k 1.1k

  2. MOSS-VL MOSS-VL Public

    MOSS-VL is the core multimodal model series within the OpenMOSS ecosystem, dedicated to visual understanding.

    Python 219 4

  3. MOSS-TTS MOSS-TTS Public

    MOSS‑TTS Family is an open‑source speech and sound generation model family from MOSI.AI and the OpenMOSS team. It is designed for high‑fidelity, high‑expressiveness, and complex real‑world scenario…

    Python 1.5k 135

  4. MOVA MOVA Public

    MOVA: Towards Scalable and Synchronized Video–Audio Generation

    Python 950 77

  5. MOSS-TTS-Nano MOSS-TTS-Nano Public

    MOSS-TTS-Nano is an open-source multilingual tiny speech generation model from MOSI.AI and the OpenMOSS team. With only 0.1B parameters, it is designed for realtime speech generation, can run direc…

    Python 1.3k 146

  6. MOSS-Audio MOSS-Audio Public

    MOSS-Audio is an open-source foundation model for unified audio understanding, enabling speech, sound, music, captioning, QA, and reasoning in real-world scenarios.

    Python 108 3

Repositories

Showing 10 of 50 repositories
  • Language-Model-SAEs Public

    Performant framework for training, analyzing and visualizing Sparse Autoencoders (SAEs) and their frontier variants.

    OpenMOSS/Language-Model-SAEs’s past year of commit activity
    Python 212 28 8 0 Updated Apr 16, 2026
  • MOSS-Audio Public

    MOSS-Audio is an open-source foundation model for unified audio understanding, enabling speech, sound, music, captioning, QA, and reasoning in real-world scenarios.

    OpenMOSS/MOSS-Audio’s past year of commit activity
    Python 108 3 1 0 Updated Apr 16, 2026
  • mlx-audio Public Forked from Blaizzy/mlx-audio

    A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.

    OpenMOSS/mlx-audio’s past year of commit activity
    Python 5 MIT 561 0 0 Updated Apr 16, 2026
  • MOSS-TTS-Nano Public

    MOSS-TTS-Nano is an open-source multilingual tiny speech generation model from MOSI.AI and the OpenMOSS team. With only 0.1B parameters, it is designed for realtime speech generation, can run directly on CPU without a GPU, and keeps the deployment stack simple enough for local demos, web serving, and lightweight product integration.

    OpenMOSS/MOSS-TTS-Nano’s past year of commit activity
    Python 1,274 Apache-2.0 146 19 2 Updated Apr 16, 2026
  • OpenMOSS/MOSS-TTS-Nano-Reader’s past year of commit activity
    JavaScript 19 1 0 0 Updated Apr 14, 2026
  • MOSS-VL Public

    MOSS-VL is the core multimodal model series within the OpenMOSS ecosystem, dedicated to visual understanding.

    OpenMOSS/MOSS-VL’s past year of commit activity
    Python 219 Apache-2.0 4 0 0 Updated Apr 14, 2026
  • sglang Public
    OpenMOSS/sglang’s past year of commit activity
    Python 3 Apache-2.0 0 0 0 Updated Apr 14, 2026
  • MOSS-Audio-Tokenizer Public

    MOSS-Audio-Tokenizer is a Causal Transformer-based audio tokenizer built on the CAT architecture. Trained on 3M hours of diverse audio, it supports streaming and variable bitrates, delivering SOTA reconstruction and strong performance in generation and understanding—serving as a unified interface for next-generation native audio language models.

    OpenMOSS/MOSS-Audio-Tokenizer’s past year of commit activity
    Python 191 Apache-2.0 12 3 1 Updated Apr 13, 2026
  • OpenMOSS/MOSS-TTS-Nano-Demo’s past year of commit activity
    CSS 1 1 0 0 Updated Apr 13, 2026
  • MOSS-TTS Public

    MOSS‑TTS Family is an open‑source speech and sound generation model family from MOSI.AI and the OpenMOSS team. It is designed for high‑fidelity, high‑expressiveness, and complex real‑world scenarios, covering stable long‑form speech, multi‑speaker dialogue, voice/character design, environmental sound effects, and real‑time streaming TTS.

    OpenMOSS/MOSS-TTS’s past year of commit activity
    Python 1,470 Apache-2.0 135 23 1 Updated Apr 13, 2026

Top languages

Loading…

Most used topics

Loading…