|
|
|
|
|
by gen220
314 days ago
|
|
Disclosure: my current employer has a product in this space (graphite.dev) IME the highest value (at the moment) is having an LLM integrated into the PR page, that reads your code + CI log, and effectively operates as a sanity check / semantic linter. A common workflow for us: is Draft PR -> Passes CI (inclusive of an LLM 'review') -> Published -> Passes Human review -> Scheduled to merge The goal is to get a higher margin of confidence that your code (1) will not blow up in production (2) faithfully does what it's trying to do. The value of the LLM reviewer is maybe 80% in the first bucket and 20% in the second bucket, IME. It often catches bugs like "off by one" and "you meant this to be `if not x`, based on the flag name and behavior, not `if x`". |
|