Skip to content

Feature Request: Integrate FunASR / SenseVoice as ASR engine #2178

@LauraGPT

Description

@LauraGPT

Summary

Would love to see FunASR and SenseVoice supported as ASR engines in the TEN Framework for conversational voice AI agents.

Why FunASR / SenseVoice?

  • Real-time streaming ASR — FunASR has native streaming support with punctuation restoration and inverse text normalization, ideal for conversational agents.
  • Ultra-low latency — SenseVoice-Small processes 10 seconds of audio in ~70ms, enabling responsive voice interactions.
  • 50+ languages — Multilingual support out of the box.
  • Fully offline — No API calls needed, runs locally with Apache 2.0 license.
  • Industrial-grade — Used in production by Alibaba Cloud and 100K+ installations.

Key Models

Model Specialty HuggingFace
Paraformer High-accuracy non-autoregressive ASR iic/speech_paraformer
SenseVoice Multi-task (ASR + emotion + events) FunAudioLLM/SenseVoiceSmall
Fun-ASR-Nano End-to-end, HuggingFace native FunAudioLLM/Fun-ASR-Nano

References

Happy to help with integration!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions