Pinned Loading
-
open-r1
open-r1 PublicA RL framework that supports latest LLMs and VLMs (added new rl features and new verifiers, enhanced profiling)
Python
-
VLM-reward-hacking-detection
VLM-reward-hacking-detection PublicA framework to train soft tokens and a backbone VLM for detecting reward hacking in target VLMs.
Python
-
flash-attn-economical-gpu
flash-attn-economical-gpu PublicA set of efficient flash attention implementations based on Triton, beating or reaching comparable performance with Pytorch's cuDNN-backed SDPA on Ampere and Turing GPUs.
Python
-
recsys-retailrocket
recsys-retailrocket PublicRecSys model training framework on a challenging dataset (finished in 2 days with AI coding)
Python
-
-
FM
FM PublicAn efficient implementation of Factorization Machines that supports order=3 features in linear time.
Python
If the problem persists, check the GitHub status page or contact support.