Your comments

Yeah, It is SharpDX based, no other deps, but actually a lot of tweaks were made within old code, to reduce the amount of calculations, mostly while going from screen coords to axis coords and vice versa. As a result my fork will be faster even without GPU acceleration. Now I'm working on using single-precision calculations where it is reasonable. Memory footprint became a bit larger and adding points. Gimme a week and I'll fork :)