|
|
|
|
|
by majeedkazemi
745 days ago
|
|
The same goes with human TAs that are extensively used in undergrad introductory programming classes. They can also be unreliable in many cases. 1. Provide students with the tools and knowledge to critically verify responses, either coming from an educator or a an AI agent.
2. Build more transparent AI agents that show how reliable they are on different types of queries. Our deployment showed that the Help Fix Code was less reliable, while other features were significantly better. But totally agree that we should be discussing the ethical implications much more. |
|
I think one difference is that human TAs can, theoretically, be held accountable for their reliability whereas a holding a LLM accountable is a little more difficult.