YetAnotherStringMatcher 1.0.3

dotnet add package YetAnotherStringMatcher --version 1.0.3                
NuGet\Install-Package YetAnotherStringMatcher -Version 1.0.3                
This command is intended to be used within the Package Manager Console in Visual Studio, as it uses the NuGet module's version of Install-Package.
<PackageReference Include="YetAnotherStringMatcher" Version="1.0.3" />                
For projects that support PackageReference, copy this XML node into the project file to reference the package.
paket add YetAnotherStringMatcher --version 1.0.3                
#r "nuget: YetAnotherStringMatcher, 1.0.3"                
#r directive can be used in F# Interactive and Polyglot Notebooks. Copy this into the interactive tool or source code of the script to reference the package.
// Install YetAnotherStringMatcher as a Cake Addin
#addin nuget:?package=YetAnotherStringMatcher&version=1.0.3

// Install YetAnotherStringMatcher as a Cake Tool
#tool nuget:?package=YetAnotherStringMatcher&version=1.0.3                

YetAnotherStringMatcher

YetAnotherStringMatcher library tries to make string matching easier, more readable while being somewhat expressive.

It tries to replace Regex to some extent, mostly when it comes to relatively basic cases.

For now it is in very early version and apart from the errors, then it may lack of "obvious things" and be kinda inconsistent in its APIs

Source Code: https://github.com/xd-loler/YetAnotherStringMatcher/tree/master/src/Core

Tests: https://github.com/xd-loler/YetAnotherStringMatcher/blob/master/src/Tests/BasicTests.cs

How to install

  • .NET CLI: dotnet add package YetAnotherStringMatcher --version 1.0.3

  • Package Manager: Install-Package YetAnotherStringMatcher -Version 1.0.3

  • Package Reference: <PackageReference Include="YetAnotherStringMatcher" Version="1.0.3" />

  • Nuget.org URL: https://www.nuget.org/packages/YetAnotherStringMatcher/

Some random examples, more in specific API description below

// Polish Postal Code
var input = "12-345";

var result = new Matcher(input)
                 .MatchDigitsOfLength(2)
                 .Then("-")
                 .ThenDigitsOfLength(3)
                 .Check();

Assert.True(result.Success);
var matcher = new Matcher("[2021-09-05] ERROR: Message1! Exception!")
                  .Match("[2021-09-05]")
                  .ThenAnything()
                  .Then("Exception!")
                  .Check();
				  
Assert.True(matcher.Success);
var input = new List<string>
{
	"[2021-09-05] ERROR: Message1",
	"[2021-09-05] WARNING: Message1",
	"[2021-09-05] INFO: Message1",
	"[2021-09-07] WARNING: Message1",
};

var pattern = new Matcher()
		  .Match("[2021-09-05] ")
		  .ThenAnyOf("WARNING:", "ERROR:");

Assert.True(pattern.Check(input[0]).Success);
Assert.True(pattern.Check(input[1]).Success);

Assert.False(pattern.Check(input[2]).Success);
Assert.False(pattern.Check(input[3]).Success);
var matcher = new Matcher("Apple Watermelon")
                  .MatchAnyOf("Apple", "Banana")
                  .Then(" ")
                  .ThenAnyOf("Giraffe", "melon", "Watermelon")
                  .Check();

Assert.True(matcher.Success);
var matcher = new Matcher("TEST")
                  .Match("test").IgnoreCase()
                  .Check();

Assert.True(matcher.Success);
// Sample Phone Number / Reusable Pattern
var input = new List<string> { "+123 345 67 89", "+1424 345 67 89" };

var pattern = new Matcher()
                  .Match("+")
                  .ThenDigitsOfLength(3)
                  .Then(" ")
                  .ThenDigitsOfLength(3)
                  .Then(" ")
                  .ThenDigitsOfLength(2)
                  .Then(" ")
                  .ThenDigitsOfLength(2);

Assert.True(pattern.Check(input[0]).Success);
Assert.False(pattern.Check(input[1]).Success);
var result = new Matcher("Apple Pineapple")
                .Match("Apple ")
                .Then("Coconut ").IsOptional()
                .Then("Pineapple")
                .Check();

Assert.True(result.Success);
var str = @"RROR: duplicate key value violates unique...
            Detail: Key (some_column)=(b01a0e23-da71-3a08-9893-11b8b2dfb069) already exists.";

var check = new Matcher(str)
                .MatchAnything()
                .Then("duplicate key value violates unique")
                .ThenAnything()
                .Then("Detail: Key ")
                .ThenExtract().ExtractAs("output")
                .Then(" already exists.")
                .Check();

Assert.True(check.Success);
Assert.Equal("(some_column)=(b01a0e23-da71-3a08-9893-11b8b2dfb069)", check.ExtractedData["output"]);
var result = new Matcher("Apple Water_Banana Watermelon")
		 .Match("Apple")
		 .ThenAnything()
		 .Then("Water")
		 .Then("melon")
		 .Check();
				 
Assert.True(result.Success);

Avaliable APIs:

Match / Then - tries to match exact string

var matcher = new Matcher("ERROR: Exception 1...")
                  .Match("ERROR: Exception ")
                  .ThenAnyOf("1", "2", "3")
                  .Check();

Assert.True(matcher.Success);

MatchAnyOf / ThenAnyOf - tries to match longest possible element of list handed as parameter.

var matcher = new Matcher("Apple Watermelon")
                  .MatchAnyOf("Apple", "Banana")
                  .Then(" ")
                  .ThenAnyOf("Giraffe", "melon", "Watermelon")
                  .Check();

Assert.True(matcher.Success);

MatchAnything / ThenAnything - matches anything non empty

var matcher = new Matcher("[2021-09-05] ERROR: Message1! Exception!")
                  .Match("[2021-09-05]")
                  .ThenAnything()
                  .Then("Exception!")
                  .Check();
				  
Assert.True(matcher.Success);

This one will FAIL:

var matcher = new Matcher("12")
                  .Match("1")
                  .ThenAnything()
                  .Then("2")
                  .Check();
				  
Assert.False(matcher.Success);

ThenExtract().ExtractAs(...) - next symbols will be extracted and returned as element of dictionary with under key provided by ExtractAs

var str = @"RROR: duplicate key value violates unique...
            Detail: Key (some_column)=(b01a0e23-da71-3a08-9893-11b8b2dfb069) already exists.";

var check = new Matcher(str)
                .MatchAnything()
                .Then("duplicate key value violates unique")
                .ThenAnything()
                .Then("Detail: Key ")
                .ThenExtract().ExtractAs("output")
                .Then(" already exists.")
                .Check();

Assert.True(check.Success);
Assert.Equal("(some_column)=(b01a0e23-da71-3a08-9893-11b8b2dfb069)", check.ExtractedData["output"]);

MatchAnythingOfLength / ThenAnythingOfLength - matches anything that has expected length

var matcher = new Matcher("01-000 London")
                  .Match("01")
                  .ThenAnythingOfLength(5)
                  .Then("London")
                  .Check();
				  
Assert.True(matcher.Success);

MatchDigitsOfLength / ThenDigitsOfLength - matches digits of expected length

var matcher = new Matcher("01-000 London")
                  .MatchDigitsOfLength(2)
                  .Then("-")
                  .MatchDigitsOfLength(3)
                  .Check();
				  
Assert.True(matcher.Success);

This one will FAIL:

var matcher = new Matcher("aa-000 London")
                  .MatchDigitsOfLength(2)
                  .Then("-")
                  .MatchDigitsOfLength(3)
                  .Check();

Assert.False(matcher.Success);

MatchDigitsWithLengthBetween / ThenDigitsWithLengthBetween - matches digits with length between [A...B]

var input = new List<string> { "12-000 London", "2-00 Zurich" };

var pattern = new Matcher()
                  .MatchDigitsWithLengthBetween(1, 2)
                  .Then("-")
                  .MatchDigitsWithLengthBetween(1, 3);

Assert.True(pattern.Check(input[0]).Success);
Assert.True(pattern.Check(input[1]).Success);

This one will FAIL:

var input = new List<string> { "121-000 London", "44-0011 Zurich" };

var pattern = new Matcher()
                  .MatchDigitsWithLengthBetween(1, 2)
                  .Then("-") // because it isn't on 3rd position
                  .MatchDigitsWithLengthBetween(1, 3);

Assert.False(pattern.Check(input[0]).Success);

MatchSymbolsOfLength / ThenSymbolsOfLength - matches symbols that are provided

var input = new List<string> { "+-123" };

var pattern = new Matcher()
                  .MatchSymbolsOfLength(new[] { '+','-'}, 2)
                  .MatchDigitsWithLengthBetween(1, 3)
                  .NoMore();

Assert.True(pattern.Check(input[0]).Success);

This one will FAIL:

var input = new List<string> { "+-123" };

var pattern = new Matcher()
                  .MatchSymbolsOfLength(new[] { '!','@'}, 2)
                  .MatchDigitsWithLengthBetween(1, 3)
                  .NoMore();

Assert.False(pattern.Check(input[0]).Success);

MatchCustomOfLength / ThenCustomOfLength - matches symbols that satisfy given predicate

Func<char, CheckOptions, bool> func = (char c, CheckOptions _) => char.IsUpper(c);

var matcher = new Matcher("ABC")
                  .MatchCustomOfLength(func, 3)
                  .Check();

Assert.True(matcher.Success);

This one will FAIL:

Func<char, CheckOptions, bool> func = (char c, CheckOptions _) => char.IsUpper(c);

var matcher = new Matcher("abc")
                  .MatchCustomOfLength(func, 3)
                  .Check();

Assert.False(matcher.Success);

NoMore - string must end here

var matcher = new Matcher("TEST123")
                  .Match("TEST")
                  .NoMore()
                  .Check();

Assert.False(matcher.Success);

IsOptional - matching will not fail if this particular element is not present

var result = new Matcher("Apple Pineapple")
                .Match("Apple ")
                .Then("Coconut ").IsOptional()
                .Then("Pineapple")
                .Check();

Assert.True(result.Success);

IgnoreCase - Ignores Case

var matcher = new Matcher("TEST")
                  .Match("test").IgnoreCase()
                  .Check();

Assert.True(matcher.Success);

IgnoreCaseForAllExisting - Ignores Case for all previous requirements

var matcher = new Matcher("TEST_ABC")
                  .Match("test_")
                  .Match("abc")
                  .IgnoreCaseForAllExisting()
                  .Check();

Assert.True(matcher.Success);

ExtractAs - read description of ThenExtract()


ThenCustom - You can provide your own implementation of IRequirement interface.


FAQ

  • Should I use this on prod? Probably not (yet)

  • Why does it use old C#? I wanted it to work even on old .NET

To Do List

Experimental Features

It is possible to generate pure Regex from Matcher itself, but it will probably not support all kind of requirements because SomethingOfLenghtRequirement receives Func<char, CheckOptions, bool> which would be pretty diffcult to convert into Regex.

var text = "APPLE1";

var matcher = new Matcher()
                    .MatchAnyOf("Apple", "Watermelon").IgnoreCase();

var generator = new RegexGenerator(matcher.GetRequirements.ToList());

var result = generator.Emit();

Assert.True(result.Success);
Assert.False(string.IsNullOrWhiteSpace(result.Code));

// ("Apple", "Watermelon") are ordered by length.
Assert.Equal("(?i)(Watermelon|Apple)", result.Code);

var regex = new Regex(result.Code);

Assert.True(matcher.Check(text).Success);
Assert.Matches(regex, text);
var matcher = new Matcher()
                .Match("[2021-09-05] ")
                .ThenAnyOf("WARNING:", "ERROR:");

var generator = new RegexGenerator(matcher.GetRequirements.ToList());
var result = generator.Emit();
Console.WriteLine(result.Code); // ([2021-09-05] )(WARNING:|ERROR:)
var matcher = new Matcher()
                .MatchAnyOf("Apple", "Banana")
                .Then(" ")
                .ThenAnyOf("Giraffe", "melon", "Watermelon");

var generator = new RegexGenerator(matcher.GetRequirements.ToList());
var result = generator.Emit();
Console.WriteLine(result.Code); // (Banana|Apple)( )(Watermelon|Giraffe|melon)

Source code: https://github.com/xd-loler/YetAnotherStringMatcher/tree/master/src/RegexCodeGeneration

Tests: https://github.com/xd-loler/YetAnotherStringMatcher/blob/master/src/Tests/RegexCodeGenTests.cs

Some Naive Benchmarks

Source Code: https://github.com/xd-loler/YetAnotherStringMatcher/tree/master/src/Benchmarking

Results

.NET SDK=6.0.100-preview.5.21302.13
  [Host]     : .NET 5.0.9 (5.0.921.35908), X64 RyuJIT
  DefaultJob : .NET 5.0.9 (5.0.921.35908), X64 RyuJIT

Compiled:

Method text Mean Error StdDev Gen 0 Allocated
Case1_YASM Apple 73.05 ns 1.140 ns 1.067 ns 0.0381 160 B
Case1_Regex Apple 53.99 ns 0.603 ns 0.564 ns - -
Case1_YASM AppleWatermelon 75.12 ns 0.572 ns 0.478 ns 0.0381 160 B
Case1_Regex AppleWatermelon 53.64 ns 0.395 ns 0.369 ns - -
Case1_YASM Watermelon 311.46 ns 1.126 ns 0.879 ns 0.0877 368 B
Case1_Regex Watermelon 46.79 ns 0.386 ns 0.342 ns - -
Method text Mean Error StdDev Gen 0 Allocated
Case2_YASM 12312(...)Apple [28] 1,029.61 ns 20.388 ns 21.815 ns 0.2575 1,080 B
Case2_Regex 12312(...)Apple [28] 46.84 ns 0.394 ns 0.308 ns - -
Case2_YASM qqqqq(...)melon [30] 594.30 ns 7.961 ns 7.447 ns 0.1812 760 B
Case2_Regex qqqqq(...)melon [30] 47.57 ns 0.567 ns 0.502 ns - -
Case2_YASM xcvxc(...)melon [44] 1,728.00 ns 16.347 ns 15.291 ns 0.5093 2,136 B
Case2_Regex xcvxc(...)melon [44] 47.03 ns 0.280 ns 0.262 ns - -
Method text Mean Error StdDev Gen 0 Allocated
Case3_YASM 12312(...)pple3 [29] 1,065.93 ns 15.903 ns 14.875 ns 0.3033 1,272 B
Case3_Regex 12312(...)pple3 [29] 63.71 ns 0.648 ns 0.606 ns - -
Case3_YASM qqqqq(...)melon [32] 720.06 ns 10.396 ns 9.724 ns 0.2270 952 B
Case3_Regex qqqqq(...)melon [32] 59.86 ns 0.697 ns 0.652 ns - -
Case3_YASM xcvxc(...)melon [44] 1,852.92 ns 21.209 ns 19.839 ns 0.5093 2,136 B
Case3_Regex xcvxc(...)melon [44] 39.03 ns 0.494 ns 0.438 ns - -
Method text Mean Error StdDev Gen 0 Allocated
Case4_YASM Lette(...)ondon [22] 348.59 ns 3.678 ns 3.260 ns 0.1106 464 B
Case4_Regex Lette(...)ondon [22] 65.44 ns 0.722 ns 0.675 ns - -
Case4_YASM Parce(...)ondon [22] 547.35 ns 8.901 ns 7.891 ns 0.1431 600 B
Case4_Regex Parce(...)ondon [22] 231.41 ns 1.582 ns 1.479 ns - -
Case4_YASM ParcelLondon 264.08 ns 3.405 ns 2.658 ns 0.1049 440 B
Case4_Regex ParcelLondon 85.63 ns 0.639 ns 0.566 ns - -

Not Compiled:

Method text Mean Error StdDev Gen 0 Allocated
Case1_YASM Apple 761.3 ns 10.47 ns 9.28 ns 0.2537 1 KB
Case1_Regex Apple 1,928.2 ns 32.31 ns 35.91 ns 0.6485 3 KB
Case1_YASM AppleWatermelon 744.4 ns 14.63 ns 21.45 ns 0.2537 1 KB
Case1_Regex AppleWatermelon 1,868.8 ns 8.65 ns 7.67 ns 0.6485 3 KB
Case1_YASM Watermelon 1,049.0 ns 9.55 ns 8.94 ns 0.3033 1 KB
Case1_Regex Watermelon 1,698.6 ns 12.28 ns 11.49 ns 0.4978 2 KB
Method text Mean Error StdDev Gen 0 Allocated
Case2_YASM 12312(...)Apple [28] 1.915 μs 0.0222 μs 0.0208 μs 0.5035 2 KB
Case2_Regex 12312(...)Apple [28] 1.483 μs 0.0122 μs 0.0108 μs 0.6046 2 KB
Case2_YASM qqqqq(...)melon [30] 1.494 μs 0.0141 μs 0.0132 μs 0.4292 2 KB
Case2_Regex qqqqq(...)melon [30] 1.533 μs 0.0100 μs 0.0094 μs 0.6046 2 KB
Case2_YASM xcvxc(...)melon [44] 2.629 μs 0.0361 μs 0.0320 μs 0.7591 3 KB
Case2_Regex xcvxc(...)melon [44] 1.307 μs 0.0125 μs 0.0111 μs 0.4539 2 KB
Method text Mean Error StdDev Gen 0 Allocated
Case3_YASM 12312(...)pple3 [29] 2.254 μs 0.0211 μs 0.0198 μs 0.5722 2 KB
Case3_Regex 12312(...)pple3 [29] 2.032 μs 0.0126 μs 0.0106 μs 0.6676 3 KB
Case3_YASM qqqqq(...)melon [32] 1.831 μs 0.0324 μs 0.0287 μs 0.4978 2 KB
Case3_Regex qqqqq(...)melon [32] 2.013 μs 0.0401 μs 0.0478 μs 0.6676 3 KB
Case3_YASM xcvxc(...)melon [44] 2.770 μs 0.0169 μs 0.0132 μs 0.7820 3 KB
Case3_Regex xcvxc(...)melon [44] 1.813 μs 0.0106 μs 0.0099 μs 0.5035 2 KB
Method text Mean Error StdDev Gen 0 Allocated
Case4_YASM Lette(...)ondon [22] 2.098 μs 0.0122 μs 0.0102 μs 0.5493 2 KB
Case4_Regex Lette(...)ondon [22] 5.163 μs 0.0573 μs 0.0536 μs 1.3046 5 KB
Case4_YASM Parce(...)ondon [22] 2.320 μs 0.0149 μs 0.0124 μs 0.5836 2 KB
Case4_Regex Parce(...)ondon [22] 6.136 μs 0.0673 μs 0.0629 μs 1.3199 5 KB
Case4_YASM ParcelLondon 2.013 μs 0.0193 μs 0.0181 μs 0.5455 2 KB
Case4_Regex ParcelLondon 5.068 μs 0.0535 μs 0.0474 μs 1.3199 5 KB

Benchmark FAQ

  • How to run Benchmark on my own computer?

Run this command inside \Benchmarking folder

dotnet run -c Release
Product Compatible and additional computed target framework versions.
.NET net5.0 was computed.  net5.0-windows was computed.  net6.0 was computed.  net6.0-android was computed.  net6.0-ios was computed.  net6.0-maccatalyst was computed.  net6.0-macos was computed.  net6.0-tvos was computed.  net6.0-windows was computed.  net7.0 was computed.  net7.0-android was computed.  net7.0-ios was computed.  net7.0-maccatalyst was computed.  net7.0-macos was computed.  net7.0-tvos was computed.  net7.0-windows was computed.  net8.0 was computed.  net8.0-android was computed.  net8.0-browser was computed.  net8.0-ios was computed.  net8.0-maccatalyst was computed.  net8.0-macos was computed.  net8.0-tvos was computed.  net8.0-windows was computed.  net9.0 was computed.  net9.0-android was computed.  net9.0-browser was computed.  net9.0-ios was computed.  net9.0-maccatalyst was computed.  net9.0-macos was computed.  net9.0-tvos was computed.  net9.0-windows was computed. 
.NET Core netcoreapp1.0 was computed.  netcoreapp1.1 was computed.  netcoreapp2.0 was computed.  netcoreapp2.1 was computed.  netcoreapp2.2 was computed.  netcoreapp3.0 was computed.  netcoreapp3.1 was computed. 
.NET Standard netstandard1.0 is compatible.  netstandard1.1 was computed.  netstandard1.2 was computed.  netstandard1.3 was computed.  netstandard1.4 was computed.  netstandard1.5 was computed.  netstandard1.6 was computed.  netstandard2.0 was computed.  netstandard2.1 was computed. 
.NET Framework net35 is compatible.  net40 was computed.  net403 was computed.  net45 was computed.  net451 was computed.  net452 was computed.  net46 was computed.  net461 was computed.  net462 was computed.  net463 was computed.  net47 was computed.  net471 was computed.  net472 was computed.  net48 was computed.  net481 was computed. 
MonoAndroid monoandroid was computed. 
MonoMac monomac was computed. 
MonoTouch monotouch was computed. 
Tizen tizen30 was computed.  tizen40 was computed.  tizen60 was computed. 
Universal Windows Platform uap was computed.  uap10.0 was computed. 
Windows Phone wp8 was computed.  wp81 was computed.  wpa81 was computed. 
Windows Store netcore was computed.  netcore45 was computed.  netcore451 was computed. 
Xamarin.iOS xamarinios was computed. 
Xamarin.Mac xamarinmac was computed. 
Xamarin.TVOS xamarintvos was computed. 
Xamarin.WatchOS xamarinwatchos was computed. 
Compatible target framework(s)
Included target framework(s) (in package)
Learn more about Target Frameworks and .NET Standard.

NuGet packages

This package is not used by any NuGet packages.

GitHub repositories

This package is not used by any popular GitHub repositories.

Version Downloads Last updated
1.0.3 376 12/5/2021
1.0.2 316 9/5/2021