Language Modelling - Search News

Open source Mamba 3 arrives to surpass Transformer architecture with nearly 4% improved language modeling, reduced latency

This release is good for developers building long-context applications, real-time reasoning agents, or those seeking to reduce GPU costs in high-volume production environments.

Nvidia shrinks LLM memory 20x without changing model weights

Nvidia's KV Cache Transform Coding (KVTC) compresses LLM key-value cache by 20x without model changes, cutting GPU memory costs and time-to-first-token by up to 8x for multi-turn AI applications.

12h

OpenAI, Mistral AI release new hardware-efficient language models

OpenAI Group PBC and Mistral AI SAS today introduced new artificial intelligence models optimized for cost-sensitive use cases. OpenAI is rolling out two algorithms called GPT-5.4 mini and GPT 5.4 ...

New Technology Brings Advanced Language Models to Everyday Devices

A Stanford engineer has demonstrated that frontier language models can run directly on everyday edge devices using convex ...

InfoQ

Google Researchers Propose Bayesian Teaching Method for Large Language Models

Google Research has proposed a training method that teaches large language models to approximate Bayesian reasoning by learning from the predictions of an optimal Bayesian system. The approach focuses ...

The Manila Times

Personal AI’s Memory-Based Small Language Models Deliver Hyper-Personalized Experiences on Comcast’s AI Grid, Powered by NVIDIA

Memory-based Small Language Models deployed across virtualized, highly distributed telecommunications networks achieve sub-500ms response times and up to 40x lower operating costs compared to LLMs on ...

Communications of the ACM

Show inaccessible results

Open source Mamba 3 arrives to surpass Transformer architecture with nearly 4% improved language modeling, reduced latency

Nvidia shrinks LLM memory 20x without changing model weights

OpenAI, Mistral AI release new hardware-efficient language models

New Technology Brings Advanced Language Models to Everyday Devices

Google Researchers Propose Bayesian Teaching Method for Large Language Models

Personal AI’s Memory-Based Small Language Models Deliver Hyper-Personalized Experiences on Comcast’s AI Grid, Powered by NVIDIA

Measuring What Matters in Large Language Model Performance

Study reveals limitations of large language models in medical diagnostics

Yann LeCun Got $1 Billion For World Model AI. These Robots Learned 1,000 Real-World Tasks In 24 Hours

The emerging types of language models and why they matter