I don't think it uses any fancy modern CPU features but this is probably the reigning champion of the 256 byte demo, and its write up[0]
[0] http://www.sizecoding.org/wiki/Memories