Sitemap

A list of all the posts and pages found on the site. For you robots out there is an XML version available for digesting as well.

Posts

Paged Attention and vLLM

6 minute read

Published: February 04, 2025

Paged Attention is a memory optimization on which the vLLM Inference Engine is based. Here is a summary of the paper on paged attention and the key features of vLLM that make it so powerful.

Are Autoencoders Fundamentally Denoisers?

6 minute read

Published: November 08, 2024

The core idea behind Autoencoders is to bottleneck information flow so that the DNN is forced to prioritize what information to propagate to the next layer (by restricting the number of dimensions in the latent space). In this project, I explore how this can be a useful denoising tool.

Einops and Einsum Summarized

4 minute read

Published: November 06, 2024

A brief summary on einops and einsum, usage documentation and an implementation of Average Pooling in CNNs using einops (inspired from the max pooling layer implemented in the original library documentation).

Implementing GPT from Scratch

9 minute read

Published: November 03, 2024

This article contains a conceptual explanation, necessary for building a language model from scratch, using the decoder-only transformer architecture. It is based on Andrej’s Karpathys GPT from scratch. The code for this conceptual guide can be found here.

Review: Interpretability in the Wild: A Circuit for Indirect Object Detection in GPT2-Small

3 minute read

Published: October 30, 2024

A paper review highlighting the key discoveries with respect to attention heads and the algorithms used.

Review: A Mathematical Framework for Transformer Circuits

7 minute read

Published: October 25, 2024

This paper provides a mental model for reasoning about the internal workings of transformers and attention heads in deep neural networks. The insights here help understand and analyze the behaviors of large models.

portfolio

Predicting AI bias using SAEs

Published: May 12, 2025

A comparative analysis of how Sparse Autoencoders and MLP activations encode gender information inside LLMs.

Automated Image Captioning

Published: May 12, 2025

Using Attention to predict image captions with greater accuracy.

Transfer Learning for Image Classification

Published: May 12, 2025

Fine-tuning ViT and ResNet for image classification on Google Cloud.

Automatic Summarization of Job Description with LLMs

Published: May 12, 2025

A workflow to fetch and summarize job descriptions with Llama 3-70B.

Beyond Lang

Published: May 12, 2025

Voice calling with real-time speech translation and transcription.

Minimizing Power Plant Externalities

Published: May 12, 2025

Using simulations to optimize power plant location for minimal externality.

Web-based Accounting System

Published: May 12, 2025

One-stop shop for managning and reconciling financial trades.

Volatility Prediction in Financial Markets

Published: May 12, 2025

Using an ensemble of GARCH to predict financial volatility with greater accuracy.

publications

Applications of Operations Research in Minimizing Emission related Externalities of Power Plants

Published in International Journal of Scientific Research and Engineering Development, 2019

Minimizing Toxic Exposure from Power Plants

Recommended citation: Doshi P et al. (2019). "Applications of operations research in minimizing emission related externalities of power plants." International Journal of Scientific Research and Engineering Development.
Download Paper

teaching

Teaching experience 1

Undergraduate course, University 1, Department, 2014

This is a description of a teaching experience. You can use markdown like any other post.

Teaching experience 2

Workshop, University 1, Department, 2015

This is a description of a teaching experience. You can use markdown like any other post.

Pratik Doshi

Sitemap

Pages

Posts

portfolio

publications

talks

teaching