techdays25

A collection of Jupyter Notebooks and Code Snippets for the Atruvia Tech Days 2025.

Breaking AI Bottlenecks: The Art of Neural Network Inference Optimization

The deployment of deep learning models in real-world AI applications often encounters significant challenges, including high latency, limited computational resources, and scalability demands. Whether deploying models on edge/mobile devices, large-scale cloud infrastructures, or on-premises systems, achieving optimal inference performance is crucial for applications like image recognition, document understanding, and speech processing. Without thorough optimization, even the most sophisticated models risk falling short in terms of practical usability due to inefficiencies in computation and resource management. In this hands-on session, we will explore advanced techniques to optimize deep learning models for efficient inference, focusing on practical methods to achieve real-world performance gains. With a focus on maximizing performance while minimizing trade-offs in accuracy and resource usage, we aim to bridge the gap between theoretical advancements and production-ready AI systems. We will cover quantization for faster, memory-efficient models, ONNX for cross-platform deployment, and CUDA optimizations like memory management and asynchronous operations. Participants will actively engage in optimizing a pre-trained neural network step-by-step using Google Colab. With just a laptop, a Google account, and basic Python programming knowledge, you will be ready to follow along — or choose to observe the live demonstration if you prefer.

Development

Install pre-commit hooks:

pre-commit install
pre-commit install --hook-type commit-msg

Name		Name	Last commit message	Last commit date
Latest commit History 139 Commits
assets		assets
notebooks		notebooks
src/techdays25		src/techdays25
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

techdays25

Breaking AI Bottlenecks: The Art of Neural Network Inference Optimization

Development

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

MarkusThill/techdays25

Folders and files

Latest commit

History

Repository files navigation

techdays25

Breaking AI Bottlenecks: The Art of Neural Network Inference Optimization

Development

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages