Wokzy

Yegor Wokzy

DLOps engineer

Achievements

blum-bot blum-bot Public

Blum autoclicker on python

Python 62 20
NVIDIA/TensorRT-LLM NVIDIA/TensorRT-LLM Public

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

Python 13.4k 2.3k
yandex-research/context-intensive-kv-offloading yandex-research/context-intensive-kv-offloading Public

[Work in Progress] Supplementary code for "KV Cache Offloading for Context-Intensive Tasks"

2