Search for a command to run...
Full Attention Strikes Back: Transferring Full Attention into Sparse within Hundred Training Steps