News

Open and run tokenizer_to_onnx_model.ipynb - this is the most important file in the repository The notebook demonstrates how to convert a Hugging Face tokenizer to ONNX format ...
How to run VLLM on RTX PRO 6000 (and likely all other Blackwell cards in rtx pro and rtx 50xx series - cuda 12.8) under WSL2 Ubuntu 24.04 on windows 11 to play around with mistral 24b 2501, 2503, and ...