AVX512 is a bit of a disappointment

AVX-512: A Performance Disappointment for Deep Learning on Skylake-X October 2020 I have been hesitant to publish this analysis, as the findings were rather underwhelming. However, the technicalnuance makes it a story worth sharing. Recently, I acquired Skylake-X processors with AVX-512 support and optimizedthe DeepTrainer engine to take advantage of these wider vector instructions. Given […]

DeepTrainer roadmap changes

More than a year passed since the last productive post, since then I have got busy with a daytime job. In the meantime I have been thinking a lot where I could define the main selling point of this project if I wanted to make it to generate income for me. I stopped making the […]

Dynamic Graph Architecture to Enable Composite Networks

Dynamic Graph Architecture: Enabling Composite Networks DeepTrainer Technical Update The evolution of neural network architectures moves rapidly. Recently, the rise of Deep Residual Learning (ResNet) caught my attention. The core concept—introducing “skip connections” where the input of a layer is added to the output of a later layer—presented a challenge for DeepTrainer’s existing architecture. My […]

New XNNS file format to save and load neural networks

I have been reading about the ONNX file format recently that has been created internally by Google, but before I delve myself into protocol buffers I still needed an easily readable (read: debuggable) file format to exchange neural network states. I know that eventually I will have to end up supporting ONNX so I did […]