Hacker News new | ask | show | jobs
by espadrine 1176 days ago
One flaw that it has is to extend the question despite the prompt requesting an answer.

For instance, I compare llama-7b-se-rl-peft to OA_SFT_Llama_30B[0] on the following question: “What is the color of the sky on Mars?”

llama-7b-se-rl-peft: “There are a few other related questions in this site, but none addresses exactly what I’m looking for. If you have been to mars (or if someone else has) can you please tell me what the colour of the sky is and maybe how it looks from different angles or at night time etc.”

OA_SFT_Llama_30B: “The color of the sky on Mars depends on various factors, such as time of day and year, atmospheric conditions, and viewing angle. However, in general, the Martian sky is often described as having a reddish or pink hue due to the presence of iron oxide in its soil and atmosphere.”

It could be the smaller size, or it could be the reward model not incentivizing a proper understanding of the “Answer:” tokens. Still, it is nice to see those open efforts.

[0]: https://open-assistant.io/chat

1 comments

just tried same for curiosity "The Mars sky would be pink if you were to look up and observe it in person. This is due to the presence of finely dispersed iron oxide particles in its atmosphere, which scatter blue light from the sun and cause the sky to appear reddish-pink in color. This phenomenon is known as "Rayleigh scattering." If seen through a telescope on Earth however, the Martian sky may appear dark or even black depending on atmospheric conditions and time of day/year. It can sometimes be observed with a bluish tint during sunrise or sunset when more sunlight reaches the planet's surface and illuminates the dust particles in the air. During certain times of year (such as spring and summer), a thin layer of water ice clouds can also form at higher altitudes, causing the sky to become slightly whiter than usual."

Model

OA_SFT_Llama_30B