Skip to content

[Python] CSV reader returns different values in 23.0.0 #49003

@rhshadrach

Description

@rhshadrach

Describe the bug, including details regarding any error messages, version, and platform.

This is currently causing failures on the pandas CI.

import pyarrow.csv

table = pyarrow.csv.read_csv(io.BytesIO(f"data\n10E-617".encode()))
print(table)
# pyarrow.Table  <-- pyarrow 22.0.0
# data: double
# ----
# data: [[0]]

# pyarrow.Table  <-- pyarrow 23.0.0
# data: string
# ----
# data: [["10E-617"]]

It's not clear to me if this was an intentional change. The only thing that looked possibly related from the release notes was

#47925

but I haven't bisected to find if this was the cause.

Component(s)

Python

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions