Many people have been struggling to reproduce the benchmark numbers included in the original llama paper.