Skip to content
View RayWang96's full-sized avatar

Highlights

  • Pro

Block or report RayWang96

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Popular repositories Loading

  1. GPU_KNNG GPU_KNNG Public

    Source code of CIKM 2021 paper: Fast k-NN Graph Construction by GPU based NN-Descent.

    Cuda 9 1

  2. DeepGEMM DeepGEMM Public

    Forked from deepseek-ai/DeepGEMM

    DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

    Python 1 1

  3. HugeCTR HugeCTR Public

    Forked from NVIDIA-Merlin/HugeCTR

    HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training

    C++

  4. raft raft Public

    Forked from rapidsai/raft

    RAFT contains fundamental widely-used algorithms and primitives for data science, graph and machine learning.

    Cuda

  5. flashinfer flashinfer Public

    Forked from flashinfer-ai/flashinfer

    FlashInfer: Kernel Library for LLM Serving

    Cuda