FEDS Paper: Validating Large Language Model Annotations

Anne Lundgaard HansenThis paper proposes a validation framework for LLM-generated measurements when reliable benchmarks are unavailable. Validity is established by testing whether an LLM can reconstruct passages from annotated labels while maintaining semantic consistency with the original text. The framework avoids circular reasoning by establishing testable prerequisite properties that must be met for a validation to be considered successful.

Supreme Court’s tariff decision still leaves a ‘mess’ for companies trying to grab refunds

Containers are stacked up in a cargo terminal in Frankfurt, Germany. AP Photo/Michael ProbstU.S. companies stung by President Donald Trump’s emergency tariffs had hoped for relief when the U.S. Supreme Court ruled in February 2026 in their favor. But settling on a remedy – namely, rebate checks from the government – may be an even bigger headache.

Pages

Subscribe to Front page feed