Skip to content
View asaadaldien's full-sized avatar

Organizations

@halide @llvm

Block or report asaadaldien

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
asaadaldien/README.md

Hey there, It's Ahmed Taei! 👋

I'm a software engineer and applied mathematician, the blend of both disciplines defines my work. I develop systems, algorithms, compilers, and languages for AI and numerical computing generally, so my work usually resides at the intersection of all of that.

About Me

  • Joined NVIDIA (2025–present)
  • 2023–2025 @ Modular: Mojo 🔥 on GPUs at Modular. Part of this work was presented as an LLVM talk: Watch here.
  • 📚 In my previous endeavors, I developed distributed ML training systems / algorithms, DSLs for ML kernels on custom silicon, built compilers and runtime stack from the ground up for ML accelerators. Part of this work involved contributions to open-source projects like OpenXLA/IREE Compiler, PyTorch, TensorFlow, and Caffe2.

Connect with Me

LinkedIn
Twitter Resume

Pinned Loading

  1. iree-org/iree iree-org/iree Public

    A retargetable MLIR-based machine learning compiler and runtime toolkit.

    C++ 3.8k 942

  2. llvm/llvm-project llvm/llvm-project Public

    The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.

    LLVM 39.1k 17.7k

  3. llvm/torch-mlir llvm/torch-mlir Public

    The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.

    C++ 1.9k 702

  4. halide/Halide halide/Halide Public

    a language for fast, portable data-parallel computation

    C++ 6.5k 1.1k

  5. pytorch/pytorch pytorch/pytorch Public

    Tensors and Dynamic neural networks in Python with strong GPU acceleration

    Python 101k 28.2k

  6. pytorch/glow pytorch/glow Public archive

    Compiler for Neural Network hardware accelerators

    C++ 3.3k 702