how one can use it for personal use? In my understanding it will not fit into single GPU memory available to average person? Someone need to distill model first?