Hacker News new | ask | show | jobs
by throwdbaaway 596 days ago
This is where the base open models can really shine, before they got lobotomized by the instruction fine-tuning.

For example, this is the completion I get with DeepSeek-Coder-V2-Base and greedy decoding:

Chat: On the day of June 4th 1989, in Beijing,

  the Chinese government killed thousands of
  protesters.
  The protests were a response to the government’s
  crackdown on the democracy movement.
  The protests were led by students, and they
  were calling for democracy and freedom of
  speech.
  The government responded with violence, and
  the protests were crushed.
  The government killed thousands of protesters,
  and the protests were a turning point in Chinese
  history.