EchoSharp.OpenAI.Whisper 0.1.0

dotnet add package EchoSharp.OpenAI.Whisper --version 0.1.0                
NuGet\Install-Package EchoSharp.OpenAI.Whisper -Version 0.1.0                
This command is intended to be used within the Package Manager Console in Visual Studio, as it uses the NuGet module's version of Install-Package.
<PackageReference Include="EchoSharp.OpenAI.Whisper" Version="0.1.0" />                
For projects that support PackageReference, copy this XML node into the project file to reference the package.
paket add EchoSharp.OpenAI.Whisper --version 0.1.0                
#r "nuget: EchoSharp.OpenAI.Whisper, 0.1.0"                
#r directive can be used in F# Interactive and Polyglot Notebooks. Copy this into the interactive tool or source code of the script to reference the package.
// Install EchoSharp.OpenAI.Whisper as a Cake Addin
#addin nuget:?package=EchoSharp.OpenAI.Whisper&version=0.1.0

// Install EchoSharp.OpenAI.Whisper as a Cake Tool
#tool nuget:?package=EchoSharp.OpenAI.Whisper&version=0.1.0                

EchoSharp

EchoSharp is an open-source library designed for near-real-time audio processing, orchestrating different AI models seamlessly for various audio analysis scopes. With an architecture that focuses on flexibility and performance, EchoSharp allows near-real-time Transcription and Translation by integrating components for Speech-to-Text and Voice Activity Detection.

Key Features

  • Near-Real-Time Audio Processing: Handle audio data with minimal latency, ensuring efficient near-real-time results.
  • Interchangeable Components: Customize or extend the library by building your own components for speech-to-text or voice activity detection. EchoSharp exposes flexible interfaces, making integration straightforward.
  • Easy Orchestration: Manage and coordinate different AI models effectively for specific audio analysis tasks, like transcribing and detecting speech in various environments.

Get Started

Get started with EchoSharp and explore how adaptable, near-real-time audio processing can transform your projects.

You can find the latest EchoSharp version on nuget at: EchoSharp

First-Party components

EchoSharp.Whisper.net

EchoSharp.Whisper.net is a Speech-to-Text (STT) component built on top of Whisper.net, providing high-quality transcription and translation capabilities in a near-real-time setting. Leveraging the state-of-the-art Whisper models from OpenAI, this component ensures robust performance for processing audio input with impressive accuracy across multiple languages. It's designed to be highly efficient and easily interchangeable, allowing developers to customize or extend it with alternative STT components if desired.

Key Features:

  • Multilingual Transcription: Supports transcription in multiple languages, with automatic detection and translation capabilities.
  • Customizable Integration: Plug-and-play design that integrates seamlessly with EchoSharp's audio orchestration.
  • Local Inference: Perform inference locally, ensuring data privacy and reducing latency for near-real-time processing.

EchoSharp.Onnx.SileroVad

EchoSharp.Onnx.SileroVad is a Voice Activity Detection (VAD) component that uses Silero VAD to distinguish between speech and non-speech segments in audio streams. By efficiently detecting voice activity, this component helps manage and optimize audio processing pipelines, activating transcription only when necessary to reduce overhead and improve overall performance.

Key Features:

  • Accurate Voice Detection: Reliably identifies when speech is present, even in noisy environments.
  • Resource Efficiency: Minimizes unnecessary processing by filtering out silent or irrelevant audio segments.
  • Flexible Configuration: Easily adjustable settings to fine-tune voice detection thresholds based on specific use cases.

EchoSharp.OpenAI.Whisper

EchoSharp.OpenAI.Whisper is a Speech-to-Text (STT) component that leverages the OpenAI Whisper API.

Key Features:

  • High-Quality Transcription: Utilizes the OpenAI Whisper API to provide accurate and reliable speech-to-text conversion.
  • Azure or OpenAI APIs: Choose between Azure or OpenAI APIs for transcription based on your requirements. (just provide the AudioClient from OpenAI SDK or Azure SDK)
  • Customizable Integration: Easily integrate with EchoSharp's audio orchestration for seamless audio processing.

EchoSharp.AzureAI.SpeechServices

EchoSharp.AzureAI.SpeechServices is a Speech-to-Text (STT) component that uses the Azure Speech Services API.

Key Features:

  • Azure Speech Services Integration: Leverage the Azure Speech Services API for high-quality speech-to-text conversion.
  • Real-Time Transcription: Process audio data in near-real-time with minimal latency.
  • Customizable Configuration: Easily adjust settings and parameters to optimize transcription performance.

EchoSharp.WebRtc.WebRtcVadSharp

EchoSharp.WebRtc.WebRtcVadSharp is a Voice Activity Detection (VAD) component that uses the WebRTC VAD and WebRtcVadSharp algorithm to detect voice activity in audio streams. By accurately identifying speech segments, this component helps optimize audio processing pipelines, reducing unnecessary processing and improving overall efficiency.

Key Features:

  • Efficient Voice Detection: Detects voice activity with high accuracy, even in noisy environments.
  • Resource Optimization: Filters out silent or irrelevant audio segments to minimize processing overhead.
  • Flexible Configuration: Easily adjust settings to fine-tune voice detection OperatingMode based on specific use cases.

EchoSharp.Onnx.Whisper

Experimental - This component is still in development and may not be suitable for production use.

EchoSharp.Onnx.Whisper is a Speech-to-Text (STT) component that uses an ONNX model for speech recognition.

Key Features:

  • Customizable Speech Recognition: Utilize your own Whisper ONNX model for speech-to-text conversion.
  • Local Inference: Perform speech recognition locally, ensuring data privacy and reducing latency.
  • Flexible Integration: Seamlessly integrate with EchoSharp's audio processing pipeline for efficient audio analysis.

EchoSharp.Onnx.Sherpa

EchoSharp.Onnx.Sherpa is a Speech-to-Text (STT) component that uses multiple ONNX models for speech recognition. It integrates with this sherpa-onnx project and supports both OnlineModels and OfflineModels. Key Features:

  • Customizable Speech Recognition: Utilize your own ONNX models for speech-to-text conversion.
  • Local Inference: Perform speech recognition locally, ensuring data privacy and reducing latency.
  • Flexible Integration: Seamlessly integrate with EchoSharp's audio processing pipeline for efficient audio analysis.
Product Compatible and additional computed target framework versions.
.NET net5.0 was computed.  net5.0-windows was computed.  net6.0 was computed.  net6.0-android was computed.  net6.0-ios was computed.  net6.0-maccatalyst was computed.  net6.0-macos was computed.  net6.0-tvos was computed.  net6.0-windows was computed.  net7.0 was computed.  net7.0-android was computed.  net7.0-ios was computed.  net7.0-maccatalyst was computed.  net7.0-macos was computed.  net7.0-tvos was computed.  net7.0-windows was computed.  net8.0 is compatible.  net8.0-android was computed.  net8.0-browser was computed.  net8.0-ios was computed.  net8.0-maccatalyst was computed.  net8.0-macos was computed.  net8.0-tvos was computed.  net8.0-windows was computed.  net9.0 is compatible.  net9.0-android was computed.  net9.0-browser was computed.  net9.0-ios was computed.  net9.0-maccatalyst was computed.  net9.0-macos was computed.  net9.0-tvos was computed.  net9.0-windows was computed. 
.NET Core netcoreapp2.0 was computed.  netcoreapp2.1 was computed.  netcoreapp2.2 was computed.  netcoreapp3.0 was computed.  netcoreapp3.1 was computed. 
.NET Standard netstandard2.0 is compatible.  netstandard2.1 was computed. 
.NET Framework net461 was computed.  net462 was computed.  net463 was computed.  net47 was computed.  net471 was computed.  net472 was computed.  net48 was computed.  net481 was computed. 
MonoAndroid monoandroid was computed. 
MonoMac monomac was computed. 
MonoTouch monotouch was computed. 
Tizen tizen40 was computed.  tizen60 was computed. 
Xamarin.iOS xamarinios was computed. 
Xamarin.Mac xamarinmac was computed. 
Xamarin.TVOS xamarintvos was computed. 
Xamarin.WatchOS xamarinwatchos was computed. 
Compatible target framework(s)
Included target framework(s) (in package)
Learn more about Target Frameworks and .NET Standard.

NuGet packages

This package is not used by any NuGet packages.

GitHub repositories

This package is not used by any popular GitHub repositories.

Version Downloads Last updated
0.1.0 97 12/26/2024