torch_gc/empty cache after generation
added torch_gc() which calls both cuda.empty_cache() and cuda.ipc_collect() called before and after generation
Showing
Please register or sign in to comment
added torch_gc() which calls both cuda.empty_cache() and cuda.ipc_collect() called before and after generation