backend & ml infrastructure · alexandria, egypt
Founding engineer at Nover. I run GPU inference infrastructure on AWS, build distributed systems in Go and Python, and occasionally trace deep learning all the way back to statistical mechanics.
fig. 0 — a live 2D Ising model (Metropolis–Hastings), taken from my Physics-AI Bridge research. Drag the temperature through the critical point Tc ≈ 2.27 and watch order emerge from noise. This isn't a looping gif — it's computing in your browser right now.
§0 — brief
I'm a final-year Electrical Engineering (Electronics & Communications) student at Alexandria University who didn't wait for graduation to build real things.
Right now I'm the founding engineer at Nover, an AI image-generation startup, where I built the production backend — fine-tuning diffusion models on H100s, serving them from a GPU fleet on AWS, and owning everything in between. We're raising pre-seed.
Before that, I spent a year and a half as a software engineer at a US neuromodulation research company, building a real-time EEG "digital twin" pipeline — streaming brainwave data from OpenBCI hardware through Hilbert transforms into coupled-oscillator models — and authored a research paper on Kuramoto-based EEG synchronization.
On the side, I led Mind Cloud, a 70+ member robotics organization, from a 9th-place finish at the European Rover Challenge to 2nd and 3rd place at UGVC — migrating the whole stack to ROS 2 along the way.
I like building things that are hard. I like shipping them even more.
§1 — current work
Nover generates images for creative teams. Every generation request flows through infrastructure I own end to end: an API gateway, a distributed job queue, an autoscaled GPU worker fleet, and global content delivery — all on AWS. The specifics stay under wraps until we launch; the responsibility doesn't.
Beyond serving, I run full fine-tunes of our proprietary diffusion models on rented H100s, and own the release infrastructure — including the pipeline that delivered our waitlist launch announcement, with throttling and resume capability built in.
FROM THE TRENCHES
Our heaviest generation workloads kept hitting a memory-bound failure mode that looked like a GPU problem. It wasn't — the bug lived in the request path. Fixed the path, then upgraded the fleet anyway. The full story comes after launch.
§2 — case study
A fault-tolerant task queue built on the Reliable Queue Pattern: at-least-once delivery
through atomic Redis LMOVE operations, multi-tier priority queues, delayed scheduling via
sorted sets, batch ingestion, and dead-letter queues for poison messages.
The interesting part is failure. A lease-based reaper watches for workers that died mid-task and atomically reclaims their work, and the whole system is instrumented with native Prometheus metrics. I validated it the only way that counts — automated chaos tests that kill workers at random while the queue is under load.
WHAT BROKE
Chaos testing exposed a race between a slow worker renewing its lease and the reaper reclaiming the same task — two consumers, one job. The fix was making check-and-extend a single atomic Lua script on Redis, closing the window entirely.
§3 — case study
A streaming pipeline that watches BTC trade on two exchanges at once and spots the moments they disagree. WebSocket feeds from Coinbase and Binance land in Kafka; Flink tumbling windows align the two streams in time and compute the spread; an alerter pushes live notifications to Discord while a Streamlit dashboard charts spreads and alert history.
The system is decomposed into four services — producer, stream processor, alerter, dashboard — each independently deployable and orchestrated with Docker Compose, so any piece can fail or restart without taking down the pipeline.
DESIGN NOTE
Two exchanges tick at different rates, so comparing "latest price vs latest price" lies to you. Tumbling windows force both streams onto the same clock before the spread is computed — the alignment, not the math, is the hard part.
§4 — case study
A high-performance packet sniffer in modern C++. NetProbe captures live traffic through Npcap, performs deep inspection of TCP/IP headers, and extracts TLS SNI — so you can see which hosts a machine is talking to even when the payload is encrypted — rendering everything in a real-time Dear ImGui interface.
Packets arrive in bursts far faster than any UI can draw. The architecture is a multi-threaded
producer–consumer pipeline using std::jthread: a capture thread feeds a
lock-guarded buffer, parser workers drain it, and the render thread never blocks on the network.
DESIGN NOTE
The rule that shaped everything: the thread that draws pixels must never wait on the thread that reads the wire. Decoupling them is the difference between a tool engineers trust during a traffic spike and one that freezes exactly when it matters.
§5 — research
Deep learning didn't appear from nowhere — a surprising amount of it is statistical mechanics wearing a different hat. This research project traces that lineage explicitly, with working code at every step: from the 2D Ising model through Hopfield networks toward Neural Network Gaussian Processes.
Phase 1 is a fully vectorized Ising simulation — the one running at the top of this page — that reproduces the critical temperature to within 0.05% of Onsager's exact solution. Phase 2 maps Ising dynamics onto Hopfield networks as learnable energy models, recovering 68% strict pattern recall at 25% corruption, with a full LaTeX write-up.
WHY IT MATTERS
Energy landscapes, temperature, phase transitions — these aren't metaphors in machine learning, they're the actual math. Understanding a model as a physical system is the difference between tuning hyperparameters by superstition and knowing why they work.
§6 — also built
§7 — toolbox
python · go · c++ · java
typescript · sql · c
fastapi · spring boot · redis
kafka · flink · docker · prometheus
aws: ecs · s3 · efs · cloudfront ·
ses
pytorch · comfyui · lora fine-tuning
diffusion inference · modal/h100
hugging face · dsp
§8 — contact
Building something interesting? I'd love to hear about it.