It’s much easier to test Temporal Workflow in Python by invoking the contents of the individual Activities first, in the shell or via a separate script, then composing them into a Workflow. I need to see if there’s a better way to surface exceptions and failures through Temporal directly to make the feedback loop faster.
From this paper:
62% of the generated code contains API misuses, which would cause unexpected consequences if the code is introduced into real-world software
This work further reinforces some recent thoughts on the importance of measuring the quality of a language model’s output for a use case.
From my experience at a big tech co, 30%+ of engineers there are working on shipping a few buttons. I once saw a team of 20 work for 6 months to ship a back button for the onboarding flow (surprisingly hard to do). https://t.co/1wFMQ7SLFm
— Flo Crivello (@Altimor) August 26, 2023
It had been a while since I thought about this 😬