| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by snyhlxde 777 days ago

Hi we are CLLM authors and thanks for sharing your experience and insights! I can see this drawing skill refining process echoes with the training process in CLLM, the only thing is at this point stressor in CLLM training is not getting progressively demanding.

For example, while drawing, you can set very specific time limit on how long you are allowed to draw in each trial and make the time progressively shorter. In CLLM, maybe we can make this the learning process more and more difficult by mapping more and more distant states in Jacobi trajectory to its final state.

We are using the term "consistency" because we draw parallelism between consistency LLM and the consistency model in diffusion image generation where the training processes are analogous.

2 comments

boroboro4 777 days ago

Do you use same dataset to train / eval the model? Was the model used for example trained on GSM8K dataset for example?

link

snyhlxde 777 days ago

Yes, we consider both domain-specific applications (spider for text2SQL, gsm8k for math, codesearchnet for python) as well as open-domain conversational applications (ShareGPT). We use test set from each application to evaluate CLLMs’ performance in our paper.

On the other hand, technically CLLM works on any kind of queries. But the speedup might vary. Feel free to try out our codebase for your use cases!

link

Quarrel 777 days ago

Is it just me, or does this read like it was written by an LLM ... ?!

link

jasonjmcghee 777 days ago

It's just much more formal than people generally speak on HN.

link

snyhlxde 777 days ago

lol I take that as a compliment. Good try but sadly no LLM in this writing :)

link