pyGAEB: The Python Library That Unlocks GAEB Construction Data

pyGAEB: The Python Library That Unlocks GAEB Construction Data Parse, validate, classify, and write GAEB DA XML files — with optional LLM-powered item classification — in one open-source Python pac...

By · · 1 min read
pyGAEB: The Python Library That Unlocks GAEB Construction Data

Source: DEV Community

pyGAEB: The Python Library That Unlocks GAEB Construction Data Parse, validate, classify, and write GAEB DA XML files — with optional LLM-powered item classification — in one open-source Python package. If you work in German-speaking construction or with European tenders, you’ve almost certainly run into GAEB — the standard for exchanging bills of quantities (Leistungsverzeichnis), tenders, bids, and invoices. GAEB DA XML is the modern format: XML-based, versioned (2.0 through 3.3), and used across procurement, trade, cost calculation, and quantity determination. The catch? Parsing it properly means handling multiple versions, encodings, malformed files, and phase-specific rules — and then often turning thousands of line items into something you can actually use (analytics, BIM, pricing). pyGAEB is an MIT-licensed Python library that does exactly that: one API for all GAEB DA XML versions and exchange phases, a unified Pydantic domain model, optional LLM-based item classification (100+