Skip to content
View shreyansh26's full-sized avatar
👨‍🎓
Always Learning
👨‍🎓
Always Learning

Organizations

@COPS-IITBHU

Block or report shreyansh26

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
shreyansh26/README.md

👋 Hi there!

I’m the Lead ML Engineer at Level AI, where I focus on building and scaling large language models (LLMs) specifically for conversational AI. With over four years of experience in applied AI and research, I’ve worked extensively on end-to-end solutions in NLP and ML systems.

Before Level AI, I worked as a Data Scientist at Mastercard AI Garage, where I developed AI models to enhance transaction security and intelligence. I graduated in 2020 with a degree in Computer Science from the Indian Institute of Technology (BHU) Varanasi.

My technical interests include Natural Language Processing, ML Systems Engineering—including CUDA and Triton for high-performance computing, Privacy-preserving ML, and Cryptography.

I’m always working on side projects, many of which involve implementing and experimenting with ideas from research papers, efficient kernelsand other advanced techniques in these areas. You can find these projects here.

📫 How to reach me

📕 Latest Blog Posts

Pinned Loading

  1. Annotated-ML-Papers Annotated-ML-Papers Public

    Annotations of the interesting ML papers I read

    213 18

  2. FlashAttention-PyTorch FlashAttention-PyTorch Public

    Implementation of FlashAttention in PyTorch

    Python 122 16

  3. Extracting-Training-Data-from-Large-Langauge-Models Extracting-Training-Data-from-Large-Langauge-Models Public

    A re-implementation of the "Extracting Training Data from Large Language Models" paper by Carlini et al., 2020

    Python 33 5

  4. Speculative-Sampling Speculative-Sampling Public

    Implementation of Speculative Sampling as described in "Accelerating Large Language Model Decoding with Speculative Sampling" by Deepmind

    Python 79 9

  5. DeepLearning-in-the-Browser DeepLearning-in-the-Browser Public

    Deploy Deep Learning models directly in the browser. Includes code for deploying using Tensorflow.js, WebDNN, and ONNX.js.

    JavaScript 9 4

  6. Linux-Malware-Detection-Research Linux-Malware-Detection-Research Public

    A collection of Linux Malware Detection projects (research paper implementations) done by me.

    Jupyter Notebook 9 3