2023-08-07

[logs] August 7, 2023

🎧 Velocity over everything: How Ramp became the fastest-growing SaaS startup of all time | Geoff Charles (VP of Product)

This conversation between Lenny and Geoff was particularly noteworthy for me because it hit on so many areas of what I’ve seen in the most effective organizations and teams I’ve been apart of as well as realigning incentives to solve a number of problems I’ve experienced that hold teams back.

We report back operational overhead, meaning the percentage of tickets that come from your product area normalized by the number of users that are using that product
Read More…

2023-08-06

[logs] August 6, 2023

Simon wrote an excellent post on the current state of the world in LLMs.

Twitter continues to talk LK-99. It seems like an easy thing to root for but hard to tell exactly what is going on.

2023-08-03

[logs] August 3, 2023

The high-order bit that changed in AI:
"I'll give you 10X bigger computer"
- 10 years ago: I'm not immediately sure what to do with it
- Now: Not only do I know exactly what to do with it but I can predict the metrics I will achieve
Algorithmic progress was necessity, now bonus.
— Andrej Karpathy (@karpathy) August 3, 2023

Turning scaling into a systematic science is the biggest advance enabled by LLMs. https://t.co/BgHoDxeX7m
— Jim Fan (@DrJimFan) August 4, 2023

It will be interested to see if or when we hit scaling limits to training more powerful models and what our new bottleneck becomes. For now, there appears to be a lot of greenfield.

2023-08-01

[logs] August 1, 2023

While not an entirely unique perspective, I believe Apple is one of the best positioned companies to take advantage of the recent improvements in language models. I expect more generic chatbots will continue to become commodities whereas Apple will build a bespoke, multi-modal assistant with access to all your personal data on device. This assistant will be able to do anything the phone can do (invoke functions/tools) as well as answer any question about your personal data (show me photos from Christmas in 2018). Let’s hope they name it something other than Siri.

2023-07-30

[logs] July 30, 2023

A heartwarming exchange

Your project has a youthful optimism that I hope you won’t lose as you go. And in fact it might be the way to win in the long run.

2023-07-25

[logs] July 25, 2023

I tried out Llama 2 today using ollama. At first pass, it seemed ok a writing Python code but I struggled to get it to effective generate or adhere to specific schema. I’ll have to try a few more things but my initial impressions are mixed (relative to OpenAI models).

2023-07-22

[logs] July 22, 2023

language_models

It’s hard to think because it’s hard to think.
- Github Copilot

Finally learned that RAG stands for “Retriever-Augmented Generation” after seeing it all over the place for months. Not sure how I missed that one.

2023-07-19

[logs] July 19, 2023

Meta released Llama 2 yesterday and the hype has ensued. While it’s exciting to see more powerful models become available, a model with weights is not the same as an API. It is still far less accessible.

A paper came out on the measurement of the degradation ChatGPT’s reasoning abilities. As real-time peer review took place over the course of the day on Twitter, the most compelling explanation that I heard to explain these findings was that OpenAI has further fine-tuned the models to respond in a manner consistent with the level of the prompt, because this is a better experience for the user. We’ll see if this explanation holds up over time.

2023-07-09

[logs] July 9, 2023

I’ve been playing around more with nix lately. I like what I’ve seen from it so far: declare dependencies and get an isolated shell with those dependencies. If distributed, the environment can be trivially recreated on another machine. So far, it’s been a struggle to get a working Python environment with dependencies setup. I’ve gotten a lot of cryptic error messages after trying a number of different flake.nix files. I plan to continue to experiment, but thus far the learning curve is tough.

2023-07-07

[logs] July 7, 2023

language_models

Some unstructured thoughts on the types of tasks language models seem to be good (and bad) at completing:

A language model is an effective tool for solving problems when can describe the answer or output you want from it with language. A language model is a good candidate to replace manual processes performed by humans, where judgement or application of semantic rules is needed to get the right answer. Existing machine learning approaches are already good at classifying or predicting over a large number of features, specifically when one doesn’t know how things can or should be clustered or labelled just by looking at the data points. To give an example where a language model will likely not perform well: imagine you want to generate a prediction for the value of a house and the land it sits on, given a list of data points describing it: