Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Reading CSV with byte order mark bleeds into first Column Name #687

Open
janseeger opened this issue May 5, 2024 · 4 comments
Open

Reading CSV with byte order mark bleeds into first Column Name #687

janseeger opened this issue May 5, 2024 · 4 comments
Labels
bug Something isn't working
Milestone

Comments

@janseeger
Copy link

If you try to read a CSV file that begins with a BOM, the first column name will be all weird.

Screenshot 2024-05-05 at 20 20 56
@Jolanrensen
Copy link
Collaborator

Do you have a sample CSV so we can make some tests too?

@janseeger
Copy link
Author

Unfortunately none that I can easily share. I'll try to create a small reproducer this evening.

@janseeger
Copy link
Author

Here's the CSV File. I noticed that the Column Names are correct when accessed with columnNames() but the generated code does not strip the BOM character.

Screenshot 2024-05-07 at 12 37 21

@Jolanrensen Jolanrensen added the bug Something isn't working label May 8, 2024
@Jolanrensen Jolanrensen added this to the 0.14.0 milestone May 8, 2024
@Jolanrensen
Copy link
Collaborator

Jolanrensen commented May 8, 2024

Can reproduce, thanks :) Looks like the column name is read incorrectly entirely, since getting by name (without zero width non breaking space character) also breaks.

image

The BOM character should be skipped if a file starts with it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants