Skip to content(if available)orjump to list(if available)

ONNX Runtime and CoreML May Silently Convert Your Model to FP16

DiabloD3

This is why I laugh at so called "AI researchers". They build "quality software" like this, while everyone else stops fucking around and uses ggml and llama.cpp and doesn't have these weird issues.

omneity

Not until it gets tensor parallelism.

ipython

Eh, those “ai researchers” are too busy rolling around in mounds of freshly minted Benjamins to care about “quality software”