2024-09-02

Logs

Benchmarking >80 LLMs shows: The best model is not necessarily the best for your programming language 😱

- Best overall: Anthropic’s Sonnet 3.5
- Best for Go: Meta’s Llama 3.1 405B
- Best for Java: OpenAI’s GPT-4 Turbo
- Best for Ruby: OpenAI’s GPT-4o

Good models for one… pic.twitter.com/EYUphEI5rH
— Markus Zimmermann (@zimmskal) September 2, 2024

Great to see more concrete results published on how different models are “the best” at writing different programming languages.

Iterating on Cogno, improving the “remaining guesses” and sharing functionality.

Shipped a very-beta version of a recipe ingredient extraction app, which can then generate a grocery list from many recipes. Calling it snapcart.

Having used fasthtml and sqlite of a couple projects now, I was interested in trying to run an app as a Cloudflare Worker to make use of D1, which is a serverless, sqlite compatible database. It looks like the library needs to be added to the worker runtime first, according to this issue.

✎ Edit

Raw