🎯
Focusing
Master student at Zhejiang University, interested in data management, machine learning system
-
Zhejiang University
- China Mainland
- in/kevin-zeng-457625271
Highlights
- Pro
Pinned Loading
-
inclusionAI/cuLA
inclusionAI/cuLA PublicCUDA kernels for linear attention variants, written in CuTe DSL and CUTLASS C++.
-
SandAI-org/MagiAttention
SandAI-org/MagiAttention PublicA Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training
-
sigmod-2024-contest
sigmod-2024-contest Public🏆 The winner code for ACM SIGMOD 2024 Programming Contest. Efficient and Accurate Hybrid Vector Search
-
flashinfer-ai/flashinfer
flashinfer-ai/flashinfer PublicFlashInfer: Kernel Library for LLM Serving
-
tile-ai/tilelang
tile-ai/tilelang PublicDomain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels
-
vllm-project/vllm
vllm-project/vllm PublicA high-throughput and memory-efficient inference and serving engine for LLMs
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.


