LF AI & Data Foundation Launches DocLang Specification Working Group to Advance an Open Standard for AI-Native Documents

LF AI & Data Foundation Launches DocLang Specification Working Group to Advance an Open Standard for AI-Native Documents

LF AI & Data Foundation Launches DocLang Specification Working Group to Advance an Open Standard for AI-Native Documents

https://www.prnewswire.com/news-releases/lf-ai–data-foundation-launches-doclang-specification-working-group-to-advance-an-open-standard-for-ai-native-documents-302794922.html

Publish Date: 2026-06-09 09:00:00

Source Domain: www.prnewswire.com

New specification, supported by leading LF AI & Data member organizations IBM and Red Hat, as well as other organizations including ABBYY, complements the Docling open source project

SAN FRANCISCO, June 9, 2026 /PRNewswire/ — LF AI & Data Foundation, the premier organization supporting open source innovation in artificial intelligence and data under the Linux Foundation, today announced the formation of the DocLang Specification Working Group. This working group supports a new collaborative standards development initiative to develop DocLang, an open, universal, AI-native document format designed to improve how enterprises prepare, exchange, and govern document data for AI systems.

Founded by LF AI & Data premier members IBM, NVIDIA, and Red Hat, as well as contributors ABBYY and HumanSignal, the DocLang Working Group will operate under Joint Development Foundation’s vendor-neutral, open governance model to develop and maintain a specification that supports more reliable, interoperable document processing across AI and agentic workflows.

“Documents remain one of the most important sources of enterprise knowledge, but most were never designed for AI-driven workflows,” said Mark Collier, general manager of AI & Infrastructure at the Linux Foundation and executive director of LF AI & Data. “With the launch of the DocLang Working Group, we are bringing the open source community together to develop a vendor-neutral, interoperable standard that helps organizations prepare document data for AI more reliably, transparently, and at scale. Combined with projects like Docling, this effort can help create a more open foundation for document understanding across the AI ecosystem.”

“DocLang is the culmination of years of research into how documents can be represented more efficiently and more faithfully for AI systems,” said Peter Staar, Principal Research Scientist and Manager at IBM Software. “Our work…

Source