
Semi-structured financial text – such as earnings call transcripts, 10‑K and 10‑Q filings, MD&A sections, loan term sheets, and broker research PDFs – poses a persistent challenge for automated data extraction. These documents combine predictable patterns (dates, currency amounts, section headings) with highly variable, nuanced natural language (risk disclosures, forward‑looking statements, covenant descriptions).