Skip to content
Tarun Bevara

Articles

  • The Transformer Paradigm: Rethinking Sequence Modeling Through Attention

    The Transformer Paradigm: Rethinking Sequence Modeling Through Attention

    2025
    Read
  • Inside PaliGemma: Building an Open, Transferable 3B Vision-Language Model

    Inside PaliGemma: Building an Open, Transferable 3B Vision-Language Model

    2025
    Read
  • Less Is More: My Deep Dive into Recursive Reasoning with Tiny Networks

    Less Is More: My Deep Dive into Recursive Reasoning with Tiny Networks

    2025
    Read