Building an Air-Gapped Proxmox Lab for OSCP Prep and Detection Engineering

May 14, 2026 18 min read

infrastructure
proxmox
oscp
detection-engineering

Disclaimer: this is a personal lab on my own hardware. Nothing in this post is drawn from any client engagement, and none of the detections or offensive techniques below have ever been deployed outside the box on my desk.

I wanted one environment where I could throw real offensive tooling at a real blue-team stack, fail loudly, and leave no evidence outside a single physical box. This is how that came together.

Why air-gapped

Three reasons, in descending order of importance:

Regulatory reality. I live inside a CSSF-supervised entity. C2 traffic leaving my home network is an incident even if the intent is innocent. It is far easier to explain “I physically unplugged the uplink” than it is to explain “the EDR flagged my home lab as a staging box”.
Detection purity. When I'm evaluating a Suricata rule against a specific TTP, I need to know with certainty that the only traffic hitting the sensor is lab traffic. Background internet noise ruins honest measurement.
Mental hygiene. Hard physical boundary between work-related tooling and curiosity-driven experimentation. The red-team VLAN has no DNS resolver that can see a real root server. That is on purpose.

Hardware

The entire thing runs on one MINISFORUM MS-A2:

AMD Ryzen 9 9955HX — 16 cores / 32 threads
96 GB DDR5 SODIMM
2× NVMe Gen4 (2 TB + 2 TB, ZFS mirror)
Dual 10GbE SFP+ (Intel X710)
Dual 2.5GbE RJ45

At roughly €1,200 all-in, it outperforms a rack of 2018-era enterprise gear and idles around 35 W. A single box is a deliberate constraint: it forces me to keep the lab small enough that I can tear it down and rebuild it from Ansible in a weekend.

Peripherals:

OPNsense on a used Netgate SG-3100 (~€250). WAN port physically disconnected. Only VLANs talk to each other, and only through explicit firewall rules.
Mikrotik CRS310-8G+2S+ managed switch (~€280). All VLAN tagging lives here.
A short SFP+ DAC between the MS-A2 and the switch.

Proxmox install

Single-node PVE 8.x on the ZFS mirror. Key tweaks after the default install:

# Disable the enterprise repo, enable no-subscription
sed -i 's/^/#/' /etc/apt/sources.list.d/pve-enterprise.list
echo "deb http://download.proxmox.com/debian/pve bookworm pve-no-subscription" \
  > /etc/apt/sources.list.d/pve-no-subscription.list

# Let KVM nest — needed for running Windows with a functional EDR inside a VM
echo "options kvm-amd nested=1" > /etc/modprobe.d/kvm-amd.conf

# Zero swappiness on the host; VMs should swap in their own disks if at all
echo "vm.swappiness=0" >> /etc/sysctl.conf

# Suricata and Zeek live on the host, not in a VM — they need to see tap
# traffic from the virtual switch directly
apt install -y suricata zeek

I run no cluster. A single-node “cluster of one” keeps the moving parts down. The day I need HA for this lab is the day I've lost the plot.

Network: 10 VLANs

The core of the design. Every VM lands on exactly one VLAN. Inter-VLAN traffic is explicit, logged, and matched by at least one Suricata rule.

VLAN 10 — mgmt — Proxmox GUI, IPMI, switch admin. No lab VM ever touches it.
VLAN 20 — redteam — Kali, implant source, C2 listener.
VLAN 30 — dmz — Exposed services: IIS, Exchange, vulnerable web apps.
VLAN 40 — corp-users — Windows 10 / 11 clients, user-behavior simulation.
VLAN 50 — corp-server — Domain controllers, file servers, SQL, SCCM.
VLAN 60 — infra — Internal DNS, NTP, WSUS, PKI.
VLAN 70 — security — Wazuh manager, Graylog, sensors, EDR console.
VLAN 80 — ot — Simulated OT/ICS via Conpot and GRFICS, for NIS2 scenarios.
VLAN 90 — sandbox — Short-lived detonation VMs, snapshotted and reverted per run.
VLAN 100 — egress-sink — Fake internet; captures and graphs all C2 attempts.

Rule of thumb: if a VM needs to reach the real internet, it does not belong in this lab. Every OS image is patched offline and seeded from a separate, internet-connected machine that never plugs into the lab switch.

Blue team stack

Running on VLAN 70:

Wazuh manager — agents on every Windows and Linux host.
Graylog for log aggregation with GROK parsers for Sysmon, Suricata, and Zeek.
Suricata inline on the egress-sink VLAN. Catches beacons.
Zeek on a passive span port. Metadata-first.
Sysmon on every Windows host, using SwiftOnSecurity's config as the floor, with additions for LSASS access patterns and WMI event subscriptions.

An example of a teaching-grade Sigma rule I use to check the pipeline end-to-end. It fires on any handle open to LSASS from a non-system process, which is deliberately noisy:

title: LSASS Handle Access From Non-System Process
status: experimental
logsource:
  product: windows
  service: sysmon
detection:
  selection:
    EventID: 10
    TargetImage|endswith: '\lsass.exe'
  filter_system:
    SourceImage|contains:
      - '\System32\csrss.exe'
      - '\System32\services.exe'
      - '\System32\wininit.exe'
  condition: selection and not filter_system
level: high

Noisy rules like this are useful for plumbing tests. Once a new log path is flowing, you silence them or replace them with something more targeted.

Red team stack

Running on VLAN 20:

Kali 2026.1, kept as a golden image and cloned per engagement.
A BadBlood-seeded Active Directory: one DC, three workstations, one SQL server. Deliberately misconfigured (weak delegations, overprovisioned service accounts) to give detection rules something to catch.
Small, scratch-built Nim and Rust implants. Lab-only. Not sophisticated. The point isn't evasion research; the point is controlled beacons for Suricata to see.

The detection engineering loop

This is the whole reason the lab exists:

Pick a technique. Example: T1003.001 — LSASS memory dumping.
Run it from VLAN 20 against a host in VLAN 40 or 50.
Watch Wazuh + Graylog. Did anything fire? How quickly?
If nothing fired: write the Sigma rule, deploy it.
Vary the method: Mimikatz → nanodump → ProcDump → comsvcs.dll → direct syscall invocation. Does the rule survive?
If the rule stops firing: understand why, then improve it.
Commit rule + notes to a private Git repo. Tag by MITRE technique.
Tear down. Snapshot revert. Start over.

One technique per evening is a comfortable pace. Over six months that produces a few dozen rules I trust, and roughly the same number I had to walk away from because they couldn't survive basic variation.

Rough cost

MINISFORUM MS-A2 with 96 GB RAM and 2× 2 TB NVMe: ~€1,200
Netgate SG-3100 (used): ~€250
Mikrotik CRS310-8G+2S+ managed switch: ~€280
DAC cable, SFP+ transceivers, patch leads: ~€60
Total: ~€1,800.

Less than the cumulative cost of a decent cert path over the same period, and considerably more useful once it's running.

What's next

Three things on the list:

A second, identical MS-A2 for multi-site NIS2 resilience scenarios. One physical box is a single point of failure by design today.
Publishing stable Sigma rules to a public repo as they harden. Noisy teaching rules stay private.
A separate writeup of the Windows 11 + BitLocker + Sysmon baseline I use for every new corp-VLAN image. It deserves its own post.

If you're building something similar: the boring work — patching golden images, rotating AD passwords, keeping firewall rules coherent — takes more time than the flashy detection engineering does. Budget accordingly. The lab is a forcing function for habits, not a substitute for them.