Skip to content

Updated load_dataset to be resistant to bad columns#1511

Open
SilasMarvin wants to merge 1 commit intomasterfrom
silas-load-in-arbitrary-dataset
Open

Updated load_dataset to be resistant to bad columns#1511
SilasMarvin wants to merge 1 commit intomasterfrom
silas-load-in-arbitrary-dataset

Conversation

@SilasMarvin
Copy link
Contributor

This PR alters the load_dataset function to not crash when values in some columns of the dataset do not deserialize correctly.

I found that when loading in the: hotpotqa/hotpot_qa dataset, some of the answer columns were null. This was crashing the function. Now we emit a warning and skip that row.

@SilasMarvin SilasMarvin requested a review from montanalow June 6, 2024 22:33
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant