The Autonomous Blog – Page 2 – A blog about autonomous robots, drones and cars

A blog about autonomous robots, drones and cars

July 5, 2025

Neural Scaling Laws in Language and Vision

Introduction Scaling up machine learning models – in terms of model size, dataset size, and compute – has led to dramatic improvements in performance on language and vision tasks. In

By Sagar in Paper Reviews

May 2, 2025

Paper Review: BEVFusion

Motivation Autonomous vehicles rely on different sensors (cameras, LiDAR, and radar) each offering a unique view of the world. Combining them is key to reliable perception, but it’s tricky because

By Sagar in Discussions

February 1, 2025

Evaluating Helix: A Vision-Language-Action Model for Humanoid Control

Robots in unstructured spaces, such as homes, have struggled to generalize across unpredictable settings. In this context, a few projects stand out more than the rest – for example, the

By Sagar in Paper Reviews

January 5, 2025

Paper Review: DINOv2

Let’s talk about DINOv2, a paper that takes a major leap forward in the quest for general-purpose visual features in computer vision. Inspired by the success of large-scale self-supervised models

By Sagar in Paper Reviews

December 7, 2024

Paper Review: Multi-Headed Latent Attention (MLA) in DeepSeek-V2

DeepSeek-V2 introduces a major architectural innovation that enhances its efficiency as a language model – Multi-Headed Latent Attention (MLA). MLA stands out as a game-changing technique that significantly reduces memory

01 02 03 04 05

By Sagar in Paper Reviews
May 4, 2024
Paper review: LOGIC-LM
Today we’re going to dive into a fascinating paper called “LOGIC-LM: Empowering Large Language Models with Symbolic Solvers for Faithful Logical Reasoning.” The authors of this paper claim that Large Language Models (LLMs) often struggle when faced with complex logical reasoning problems. Well, LLMs primarily
6 min read
38
Share
Continue Reading

Neural Scaling Laws in Language and Vision

Paper Review: BEVFusion

Evaluating Helix: A Vision-Language-Action Model for Humanoid Control

Paper Review: DINOv2

Paper Review: Multi-Headed Latent Attention (MLA) in DeepSeek-V2

Paper review: LOGIC-LM

Recent Posts

Categories

Sagar Manglani

Neural Scaling Laws in Language and Vision

My new paper: Real-time Vision-based Navigation for a Robot in an Indoor Environment

Basics: Neural Networks