|
|
|
|
|
by swyx
999 days ago
|
|
my notes: - ramped up to 16k BeMyEyes + 1k developer alpha testers over 6 months - reduced frequency and severity of hallucinations - improved OCR and quality of descriptions - great demand for describing people without affecting privacy/bias -
intentionally refusing person identification 98% of the time and lowering accuracy
to 0%. also declining a whole lot of problematic queries, per fig 8 - converting known jailbreaks to images to defend against multimodal jailbreaks. ironic how jailbreak collection websites probably made it a lot easier to break the jailbreaks - interesting descriptions of mitigation process in 2.4.2. discussion linked https://twitter.com/swyx/status/1706359912283152556 |
|