Flavors of Moonshine: Tiny Specialized ASR Models for Edge Devices
Paper
•
2509.02523
•
Published
•
7
UsefulSensors/moonshine-tiny-ja の ONNX 版です。Transformers.js / onnxruntime-web で使用できます。
import {
MoonshineForConditionalGeneration,
AutoProcessor,
AutoTokenizer,
} from '@huggingface/transformers';
const MODEL_ID = 'wmoto-ai/moonshine-tiny-ja-ONNX';
const [model, processor, tokenizer] = await Promise.all([
MoonshineForConditionalGeneration.from_pretrained(MODEL_ID, { dtype: 'fp32' }),
AutoProcessor.from_pretrained(MODEL_ID),
AutoTokenizer.from_pretrained(MODEL_ID),
]);
// audioData: Float32Array (16kHz, mono)
const inputs = await processor(audioData);
const outputs = await model.generate({ ...inputs, max_new_tokens: 64 });
const text = tokenizer.decode(outputs[0], { skip_special_tokens: true });
attention_mask 入力を削除(Transformers.js互換のため内部生成に変更)重み自体は元モデルと同一です。
Moonshine AI Community License に準拠します。詳細は元モデルおよびライセンス全文を参照してください。
@misc{king2025flavorsmoonshinetinyspecialized,
title={Flavors of Moonshine: Tiny Specialized ASR Models for Edge Devices},
author={Evan King and Adam Sabra and Manjunath Kudlur and James Wang and Pete Warden},
year={2025},
eprint={2509.02523},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2509.02523}
}
Base model
UsefulSensors/moonshine-tiny-ja