For small scripts I've found the output to be very similar between small local models and GPT-4o (judging by a human eye).
For small scripts I've found the output to be very similar between small local models and GPT-4o (judging by a human eye).