Microsoft.ML.OnnxRuntimeGenAI 0.6.0

Prefix Reserved

There is a newer prerelease version of this package available.
See the version list below for details.

dotnet add package Microsoft.ML.OnnxRuntimeGenAI --version 0.6.0

NuGet\Install-Package Microsoft.ML.OnnxRuntimeGenAI -Version 0.6.0

This command is intended to be used within the Package Manager Console in Visual Studio, as it uses the NuGet module's version of Install-Package.

<PackageReference Include="Microsoft.ML.OnnxRuntimeGenAI" Version="0.6.0" />

For projects that support PackageReference, copy this XML node into the project file to reference the package.

paket add Microsoft.ML.OnnxRuntimeGenAI --version 0.6.0

The NuGet Team does not provide support for this client. Please contact its maintainers for support.

#r "nuget: Microsoft.ML.OnnxRuntimeGenAI, 0.6.0"

#r directive can be used in F# Interactive and Polyglot Notebooks. Copy this into the interactive tool or source code of the script to reference the package.

// Install Microsoft.ML.OnnxRuntimeGenAI as a Cake Addin
#addin nuget:?package=Microsoft.ML.OnnxRuntimeGenAI&version=0.6.0

// Install Microsoft.ML.OnnxRuntimeGenAI as a Cake Tool
#tool nuget:?package=Microsoft.ML.OnnxRuntimeGenAI&version=0.6.0

The NuGet Team does not provide support for this client. Please contact its maintainers for support.

About

Run Llama, Phi (Language + Vision!), Gemma, Mistral with ONNX Runtime.

This API gives you an easy, flexible and performant way of running LLMs on device using .NET/C#.

It implements the generative AI loop for ONNX models, including pre and post processing, inference with ONNX Runtime, logits processing, search and sampling, and KV cache management.

You can call a high level generate() method to generate all of the output at once, or stream the output one token at a time.

Key Features

Language and vision pre and post processing
Inference using ONNX Runtime
Generation tuning with greedy, beam search and random sampling
KV cache management to optimize performance
Multi target execution (CPU, GPU, with NPU coming!)

Sample

// See https://aka.ms/new-console-template for more information
using Microsoft.ML.OnnxRuntimeGenAI;

using OgaHandle ogaHandle = new OgaHandle();

// Specify the location of your downloaded model.
// Many models are published on HuggingFace e.g. 
// https://huggingface.co/microsoft/Phi-3-mini-4k-instruct-onnx
string modelPath = "..."
Console.WriteLine("Model path: " + modelPath);

using Model model = new Model(modelPath);
using Tokenizer tokenizer = new Tokenizer(model);

// Set your prompt here
string prompt = "public static bool IsPrime(int number)";
var sequences = tokenizer.Encode($"<|user|>{prompt}<|end|><|assistant|>");

using GeneratorParams generatorParams = new GeneratorParams(model);
generatorParams.SetSearchOption("max_length", 512);
generatorParams.SetInputSequences(sequences);

using var tokenizerStream = tokenizer.CreateStream();
using var generator = new Generator(model, generatorParams);
while (!generator.IsDone())
{
    generator.ComputeLogits();
    generator.GenerateNextToken();
    Console.Write(tokenizerStream.Decode(generator.GetSequence(0)[^1]));
}

Generates the following output:

Here's a complete implementation of the `IsPrime` function in C# that checks if a given number is prime. The function includes basic input validation and comments for clarity.

using System;

namespace PrimeChecker
{
    public class PrimeChecker
    {
        /// <summary>
        /// Checks if the given number is prime.
        /// </summary>
        /// <param name="number">The number to check.</param>
        /// <returns>true if the number is prime; otherwise, false.</returns>
        public static bool IsPrime(int number)
        {
            // Input validation
            if (number < 2)
            {
                return false;
            }

            // 2 is the only even prime number
            if (number == 2)
            {
                return true;
            }

            // Exclude even numbers greater than 2
            if (number % 2 == 0)
            {
                return false;
            }

            // Check for factors up to the square root of the number
            int limit = (int)Math.Floor(Math.Sqrt(number));
            for (int i = 3; i <= limit; i += 2)
            {
                if (number % i == 0)
                {
                    return false;
                }
            }

            return true;
        }

        static void Main(string[] args)
        {
            int number = 29;
            bool isPrime = PrimeChecker.IsPrime(number);

            Console.WriteLine($"Is {number} prime? {isPrime}");
        }
    }
}

This implementation checks if a number is prime by iterating only up to the square root of the number, which is an optimization over checking all numbers up to the number itself. It also excludes even numbers greater than 2, as they cannot be prime.

Source code repository

ONNX Runtime is an open source project. See:

(ONNX Runtime)[https://github.com/microsoft/onnxruntime]
(ONNX Runtime GenAI)[https://github.com/microsoft/onnxruntime-genai]

Documentation

See (ONNX Runtime GenAI Documentation)[https://onxxruntime.ai/docs/genai]

Product	Compatible and additional computed target framework versions.
.NET	net5.0 was computed. net5.0-windows was computed. net6.0 was computed. net6.0-android was computed. net6.0-ios was computed. net6.0-maccatalyst was computed. net6.0-macos was computed. net6.0-tvos was computed. net6.0-windows was computed. net7.0 was computed. net7.0-android was computed. net7.0-ios was computed. net7.0-maccatalyst was computed. net7.0-macos was computed. net7.0-tvos was computed. net7.0-windows was computed. net8.0 is compatible. net8.0-android was computed. net8.0-android31.0 is compatible. net8.0-browser was computed. net8.0-ios was computed. net8.0-ios15.4 is compatible. net8.0-maccatalyst was computed. net8.0-maccatalyst14.0 is compatible. net8.0-macos was computed. net8.0-tvos was computed. net8.0-windows was computed. net9.0 was computed. net9.0-android was computed. net9.0-browser was computed. net9.0-ios was computed. net9.0-maccatalyst was computed. net9.0-macos was computed. net9.0-tvos was computed. net9.0-windows was computed.
.NET Core	netcoreapp2.0 was computed. netcoreapp2.1 was computed. netcoreapp2.2 was computed. netcoreapp3.0 was computed. netcoreapp3.1 was computed.
.NET Standard	netstandard2.0 is compatible. netstandard2.1 was computed.
.NET Framework	net461 was computed. net462 was computed. net463 was computed. net47 was computed. net471 was computed. net472 was computed. net48 was computed. net481 was computed.
MonoAndroid	monoandroid was computed.
MonoMac	monomac was computed.
MonoTouch	monotouch was computed.
native	native is compatible.
Tizen	tizen40 was computed. tizen60 was computed.
Xamarin.iOS	xamarinios was computed.
Xamarin.Mac	xamarinmac was computed.
Xamarin.TVOS	xamarintvos was computed.
Xamarin.WatchOS	xamarinwatchos was computed.

Compatible target framework(s)

Included target framework(s) (in package)

Learn more about Target Frameworks and .NET Standard.

.NETCoreApp 0.0
- Microsoft.ML.OnnxRuntime (>= 1.20.1)
- Microsoft.ML.OnnxRuntimeGenAI.Managed (>= 0.6.0)
.NETFramework 0.0
- Microsoft.ML.OnnxRuntime (>= 1.20.1)
- Microsoft.ML.OnnxRuntimeGenAI.Managed (>= 0.6.0)
.NETStandard 0.0
- Microsoft.ML.OnnxRuntime (>= 1.20.1)
- Microsoft.ML.OnnxRuntimeGenAI.Managed (>= 0.6.0)
net8.0-android31.0
- Microsoft.ML.OnnxRuntime (>= 1.20.1)
- Microsoft.ML.OnnxRuntimeGenAI.Managed (>= 0.6.0)
net8.0-ios15.4
- Microsoft.ML.OnnxRuntime (>= 1.20.1)
- Microsoft.ML.OnnxRuntimeGenAI.Managed (>= 0.6.0)
net8.0-maccatalyst14.0
- Microsoft.ML.OnnxRuntime (>= 1.20.1)
- Microsoft.ML.OnnxRuntimeGenAI.Managed (>= 0.6.0)

NuGet packages (4)

Showing the top 4 NuGet packages that depend on Microsoft.ML.OnnxRuntimeGenAI:

Package	Downloads
Microsoft.SemanticKernel.Connectors.Onnx Semantic Kernel connectors for the ONNX runtime. Contains clients for text embedding generation.	57.1K
Microsoft.KernelMemory.AI.Onnx Provide access to ONNX LLM models in Kernel Memory to generate text	26.9K
feiyun0112.SemanticKernel.Connectors.OnnxRuntimeGenAI.CPU Semantic Kernel connector for Microsoft.ML.OnnxRuntimeGenAI.	1.9K
Richasy.AgentKernel.Connectors.Onnx Agent Kernel connectors for Onnx.	583

GitHub repositories (3)

Showing the top 3 popular GitHub repositories that depend on Microsoft.ML.OnnxRuntimeGenAI:

Repository	Stars
microsoft/semantic-kernel Integrate cutting-edge LLM technology quickly and easily into your apps	23.4K
microsoft/kernel-memory RAG architecture: index and query any data using LLM and natural language, track sources, show citations, asynchronous memory patterns.	1.8K
microsoft/ai-dev-gallery An open-source project for Windows developers to learn how to add AI with local models and APIs to Windows apps.	897

Version	Downloads	Last updated
0.7.0-rc1	2,256	13 days ago
0.6.0	989	a month ago
0.6.0-rc4	275	a month ago
0.6.0-rc1	262	2 months ago
0.5.2	51,439	3 months ago
0.5.1	755	4 months ago
0.5.0	653	4 months ago
0.4.0	47,542	7 months ago
0.4.0-rc1	214	7 months ago
0.3.0	27,213	9 months ago
0.3.0-rc2	2,037	9 months ago
0.3.0-rc1	209	10 months ago
0.2.0	597	10 months ago
0.2.0-rc7	289	10 months ago
0.2.0-rc6	207	5/4/2024
0.2.0-rc4	832	4/25/2024
0.2.0-rc3	175	4/24/2024
0.1.0	479	4/8/2024
0.1.0-rc4	236	3/27/2024

Release Def:
Branch: refs/heads/rel-0.6.0
Commit: 97d44f6cc438e41c8876ff92716c628519d62ba6

Microsoft.ML.OnnxRuntimeGenAI 0.6.0

About

Key Features

Sample

Source code repository

Documentation

.NETCoreApp 0.0

.NETFramework 0.0

.NETStandard 0.0

net8.0-android31.0

net8.0-ios15.4

net8.0-maccatalyst14.0

NuGet packages (4)

GitHub repositories (3)