Many of my mutus who were early believers (2023) in structured outputs found jobs in really cool companies and I am absolutely not surprised. Kings and queens.
Many of my mutus who were early believers (2023) in structured outputs found jobs in really cool companies and I am absolutely not surprised. Kings and queens.
I didn't expect DeepSeek v4 PRO (not Flash) to run well on the Mac Studio M3 Ultra with 512GB of RAM. This is 2 bit quantized with the same DwarfStar recipe used for Flash. 433GB GGUF file. 130 t/s prefill, 13 t/s generation. Prefill in the video is low because small prompt.
Under the Volcano is criminally underrated. It is the most beautiful book about self-destruction ever written. Mezcal, heat pressing down, the main character drifting. I’ve carried it around for almost two decades, and just started re-reading it. Recommend.
Writing a rebuttal to the structured-outputs-degrade-quality papers. The issues are embarrassingly basic, but unfortunately the citations keep coming. So here we go.
Following @julien_c’s tweet I bought a MacBook Pro with 128B unified memory, and started running Qwen3.6 as my daily driver. It’s baffling.
Apple outsmarted everyone in this industry.