Pete Warden and team just published a paper on Moonshine, their speech to text model.
Key features include:
- 1.7x overall speed boost compared to Whisper
- Flexible-sized input window, allowing for more efficient processing of shorter audio clips
- Up to 5x faster performance on 10-second audio clips
- Matches or exceeds Whisper's accuracy
Pete Warden and team just published a paper on Moonshine, their speech to text model.
Key features include:
- 1.7x overall speed boost compared to Whisper - Flexible-sized input window, allowing for more efficient processing of shorter audio clips - Up to 5x faster performance on 10-second audio clips - Matches or exceeds Whisper's accuracy