ML Practice

Data parallelism in pytorch

Optimizing memory operations for CUDA