Follow us on RSS

Ben Dickson is a software engineer and the founder of TechTalks, a blog that explores the ways technology is solving and creating problems. He writes about technology, business and politics. Follow him on Twitter: @BenDee983.

DeepMind makes big jump toward interpreting LLMs with sparse autoencoders

DeepMind makes big jump toward interpreting LLMs with sparse autoencoders

Ben Dickson July 26, 2024 8:04 AM

Gen AI boosts individual creativity at the cost of collective diversity, study finds

Gen AI boosts individual creativity at the cost of collective diversity, study finds

Ben Dickson July 23, 2024 1:56 PM

Researchers develop technique to give robots “embodied reasoning” abilities

Researchers develop technique to give robots “embodied reasoning” abilities

Ben Dickson July 19, 2024 1:42 PM

FlashAttention-3 unleashes the power of H100 GPUs for LLMs

FlashAttention-3 unleashes the power of H100 GPUs for LLMs

Ben Dickson July 15, 2024 2:40 PM

Meta researchers distill System 2 thinking into LLMs, improving performance on complex reasoning

Meta researchers distill System 2 thinking into LLMs, improving performance on complex reasoning

Ben Dickson July 12, 2024 9:20 PM

DeepMind’s PEER scales language models with millions of tiny experts

DeepMind’s PEER scales language models with millions of tiny experts

Ben Dickson July 12, 2024 8:53 PM

Enterprises embrace generative AI, but challenges remain

Enterprises embrace generative AI, but challenges remain

Ben Dickson July 9, 2024 4:10 PM

AI agent benchmarks are misleading, study warns

AI agent benchmarks are misleading, study warns

Ben Dickson July 6, 2024 9:37 AM

How AI Agents are changing software development

How AI Agents are changing software development

Ben Dickson July 4, 2024 5:00 AM

Alter3 is the latest GPT-4-powered humanoid robot

Alter3 is the latest GPT-4-powered humanoid robot

Ben Dickson June 24, 2024 2:34 PM

How Gradient created an open LLM with a million-token context window

How Gradient created an open LLM with a million-token context window

Ben Dickson June 24, 2024 12:47 PM

OpenVLA is an open-source generalist robotics model

OpenVLA is an open-source generalist robotics model

Ben Dickson June 18, 2024 5:50 AM

New Transformer architecture could enable powerful LLMs without GPUs

New Transformer architecture could enable powerful LLMs without GPUs

Ben Dickson June 13, 2024 9:52 AM

What we know about Apple’s on-device AI

What we know about Apple’s on-device AI

Ben Dickson June 11, 2024 2:07 PM

Stanford study finds AI legal research tools prone to hallucinations

Stanford study finds AI legal research tools prone to hallucinations

Ben Dickson June 7, 2024 1:24 PM

How foundation agents can revolutionize AI decision-making in the real world

How foundation agents can revolutionize AI decision-making in the real world

Ben Dickson June 4, 2024 4:06 PM

Meta and Google researchers’ new data curation method could transform self-supervised learning

Meta and Google researchers’ new data curation method could transform self-supervised learning

Ben Dickson May 31, 2024 7:57 AM

Microsoft, Beihang release MoRA, an efficient LLM fine-tuning technique

Microsoft, Beihang release MoRA, an efficient LLM fine-tuning technique

Ben Dickson May 28, 2024 10:11 AM

Microsoft’s vs. Apple’s AI computer strategies: Why Satya is winning (for now)

Microsoft’s vs. Apple’s AI computer strategies: Why Satya is winning (for now)

Ben Dickson May 23, 2024 7:24 AM

Meta introduces Chameleon, a state-of-the-art multimodal model

Meta introduces Chameleon, a state-of-the-art multimodal model

Ben Dickson May 21, 2024 6:35 PM

How attention offloading reduces the costs of LLM inference at scale

How attention offloading reduces the costs of LLM inference at scale

Ben Dickson May 14, 2024 1:50 PM

Nvidia’s DrEureka outperforms humans in training robotics systems

Nvidia’s DrEureka outperforms humans in training robotics systems

Ben Dickson May 6, 2024 12:53 PM

Meta’s new multi-token prediction makes AI models up to 3X faster

Meta’s new multi-token prediction makes AI models up to 3X faster

Ben Dickson May 6, 2024 9:34 AM

DeepMind researchers discover impressive learning capabilities in long-context LLMs

DeepMind researchers discover impressive learning capabilities in long-context LLMs

Ben Dickson April 24, 2024 12:46 PM

Meta challenges transformer architecture with Megalodon LLM

Meta challenges transformer architecture with Megalodon LLM

Ben Dickson April 18, 2024 12:48 PM

How LLMs are ushering in a new era of robotics

How LLMs are ushering in a new era of robotics

Ben Dickson April 16, 2024 5:04 PM

Google’s new technique gives LLMs infinite context

Google’s new technique gives LLMs infinite context

Ben Dickson April 12, 2024 2:16 PM

Sakana AI’s evolutionary algorithm discovers new architectures for generative models

Sakana AI’s evolutionary algorithm discovers new architectures for generative models

Ben Dickson March 26, 2024 8:26 AM

DeepMind and Stanford’s new robot control model follow instructions from sketches

DeepMind and Stanford’s new robot control model follow instructions from sketches

Ben Dickson March 11, 2024 1:41 PM

State Dept-backed report provides action plan to avoid catastrophic AI risks

State Dept-backed report provides action plan to avoid catastrophic AI risks

Ben Dickson March 11, 2024 11:33 AM

Why Meta’s V-JEPA model can be a big deal for real-world AI

Why Meta’s V-JEPA model can be a big deal for real-world AI

Ben Dickson February 28, 2024 6:15 AM

What is Apple’s generative AI strategy?

What is Apple’s generative AI strategy?

Ben Dickson February 15, 2024 6:00 AM

DeepMind’s GenEM uses LLMs to generate expressive behaviors for robots

DeepMind’s GenEM uses LLMs to generate expressive behaviors for robots

Ben Dickson February 6, 2024 6:00 AM

Meta’s OK-Robot performs zero-shot pick-and-drop in unseen environments

Meta’s OK-Robot performs zero-shot pick-and-drop in unseen environments

Ben Dickson January 29, 2024 2:42 PM

Beyond chatbots: The wide world of embeddings

Beyond chatbots: The wide world of embeddings

Ben Dickson January 18, 2024 12:23 PM

Stanford’s mobile ALOHA robot learns from humans to cook, clean, do laundry

Stanford’s mobile ALOHA robot learns from humans to cook, clean, do laundry

Ben Dickson January 5, 2024 10:45 AM

2023 was a great year for open-source LLMs

2023 was a great year for open-source LLMs

Ben Dickson December 26, 2023 12:16 PM

UC Berkeley’s transformer-based robot control system generalizes to unseen environments

UC Berkeley’s transformer-based robot control system generalizes to unseen environments

Ben Dickson December 18, 2023 12:04 PM

New reinforcement learning method uses human cues to correct its mistakes

New reinforcement learning method uses human cues to correct its mistakes

Ben Dickson December 5, 2023 2:15 PM

New transformer architecture can make language models faster and resource-efficient

New transformer architecture can make language models faster and resource-efficient

Ben Dickson December 1, 2023 12:41 PM