Skip to content

New Cdata element causing missing data #322

@Rohaansandhu

Description

@Rohaansandhu

When running usc-run bills for the 119th congress (this most recent one in 2025), I noticed that bill summary text was always empty. When checking the xml, I found the summaries elements however it appears as though the summary text element is now wrapped under a cdata element.

Sadly even though this change exists, the schema version remained unchanged. I made a PR for this specific issue.

If you want to take a look at the xml data I'm talking about, click on the examples I gave and look for the summaries element under bill.

Here are examples of the cdata element in congress 118 and 119:
https://www.govinfo.gov/bulkdata/BILLSTATUS/119/hr/BILLSTATUS-119hr1.xml
https://www.govinfo.gov/bulkdata/BILLSTATUS/118/hr/BILLSTATUS-118hr1.xml

Here's what the old ones looked like (without cdata):
https://www.govinfo.gov/bulkdata/BILLSTATUS/117/hr/BILLSTATUS-117hr1.xml
https://www.govinfo.gov/bulkdata/BILLSTATUS/116/hr/BILLSTATUS-116hr1.xml

I believe this may be an issue for other text elements as well, so this might be worth looking into.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions