AVX512 is a bit of a disappointment

AVX-512: A Performance Disappointment for Deep Learning on Skylake-X October 2020 I have been hesitant to publish this analysis, as the findings were rather underwhelming. However, the technicalnuance makes it a story worth sharing. Recently, I acquired Skylake-X processors with AVX-512 support and optimizedthe DeepTrainer engine to take advantage of these wider vector instructions. Given […]

Dynamic Graph Architecture to Enable Composite Networks

Dynamic Graph Architecture: Enabling Composite Networks DeepTrainer Technical Update The evolution of neural network architectures moves rapidly. Recently, the rise of Deep Residual Learning (ResNet) caught my attention. The core concept—introducing “skip connections” where the input of a layer is added to the output of a later layer—presented a challenge for DeepTrainer’s existing architecture. My […]

New XNNS file format to save and load neural networks

I have been reading about the ONNX file format recently that has been created internally by Google, but before I delve myself into protocol buffers I still needed an easily readable (read: debuggable) file format to exchange neural network states. I know that eventually I will have to end up supporting ONNX so I did […]

Why DeepTrainer?

Recently I’ve been reading quite a lot about activation functions and Neural Networks in general and I think I found a good answer to a question that has been bugging me (and others who know what I am working on) ever since I started working on my own deep learning framework. I’ve had conversations with […]

Artificial Intelligence Fight V. – Playing with activation functions, introducing CUDA C/C++, and thoughts about SGI, Nvidia and Intel.

Positive results My marketing department that’s just around in the bedroom (where dreams come t̶r̶u̶e̶  and go) have been bugging me to continue the AI Fight sequel so here it is. When I reach #XVI someone please warn me diplomatically to stop otherwise it will gain consciousness and start its own Netflix pilot. There is […]