Thursday, June 19

Reddit

Category Added in a WPeMatico Campaign

We built a data-free method for compressing heavy LLMs
News Feed, Reddit

We built a data-free method for compressing heavy LLMs

Hey folks! I’ve been working with the team at Yandex Research on a way to make LLMs easier to run locally, without calibration data, GPU farms, or cloud setups. We just published a paper on HIGGS, a data-free quantization method that skips calibration entirely. No datasets or activations required. It’s meant to help teams compress and deploy big models like DeepSeek-R1 or Llama 4 Maverick on laptops or even mobile devices. The core idea comes from a theoretical link between per-layer reconstruction error and overall perplexity. This lets us: -Quantize models without touching the original data -Get decent performance at 3–4 bits per parameter -Cut inference costs and make LLMs more practical for edge use We’ve been using HIGGS internally for fast iteration and testing, and it's proven highl...
Sam Altman tacitly admits AGI isnt coming
News Feed, Reddit

Sam Altman tacitly admits AGI isnt coming

Sam Altman recently stated that OpenAI is no longer constrained by compute but now faces a much steeper challenge: improving data efficiency by a factor of 100,000. This marks a quiet admission that simply scaling up compute is no longer the path to AGI. Despite massive investments in data centers, more hardware won’t solve the core problem — today’s models are remarkably inefficient learners. We've essentially run out of high-quality, human-generated data, and attempts to substitute it with synthetic data have hit diminishing returns. These models can’t meaningfully improve by training on reflections of themselves. The brute-force era of AI may be drawing to a close, not because we lack power, but because we lack truly novel and effective ways to teach machines to think. This shift in und...
The AI Report