-
Notifications
You must be signed in to change notification settings - Fork 81
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
TableTools.readCsv validation of column names happens after all the loading work has been done. #1561
Comments
The particular case I hit this is interesting. By default, Pandas's
And DHC trips over the empty column name with the validation exception. Interestingly enough, DHE bard loaded the same file no problem. Example:
DHE makes up a column name of |
#1469 is relevant to empty column names |
I'm happy to add empty column name support for our read csv - but I'm a bit curious why pandas would choose to write out an empty column name by default. Is there a way we can dataframe.to_csv w/ a named index column? |
https://pandas.pydata.org/docs/reference/api/pandas.DataFrame.to_csv.html#pandas.DataFrame.to_csv:
|
The particular file I was working with was obtained via
So the index for the dataframe that was output to CSV was what was created by the conversion from pyarrow table to dataframe. Which I think is likely to just be the default index that is created by dataframe contructors when one is not provided, namely, a RangeIndex(0..rows-1) with no name. If I am right in that guess, it means that a default initialized dataframe, when no index is provided explicitly during construction, results in a dataframe with an index with no name. Which, when output to CSV, creates a corresponding column with empty name. Which one could argue is a pretty sad outcome, but maybe this is the world we live in. |
@devinrsmith and @jcferretti to watch it. |
Fixed in #1629 |
# [0.59.0](deephaven/web-client-ui@v0.58.0...v0.59.0) (2024-01-17) ### Bug Fixes * GoToRow timestamp fails when selected row is out of view ([#1717](deephaven/web-client-ui#1717)) ([9ddc973](deephaven/web-client-ui@9ddc973)), closes [#1561](deephaven/web-client-ui#1561) * Interface for IrisGridTableModelTemplate.backgroundColorForCell ([#1699](deephaven/web-client-ui#1699)) ([73e1837](deephaven/web-client-ui@73e1837)), closes [#1697](deephaven/web-client-ui#1697) * Moved logos so they show in production build ([#1713](deephaven/web-client-ui#1713)) ([a3bea73](deephaven/web-client-ui@a3bea73)), closes [#1712](deephaven/web-client-ui#1712) * re-colorize command codeblocks when theme changes ([#1731](deephaven/web-client-ui#1731)) ([b1e42f5](deephaven/web-client-ui@b1e42f5)) * TimeInput not triggering onChange on incomplete values ([#1711](deephaven/web-client-ui#1711)) ([6894d96](deephaven/web-client-ui@6894d96)), closes [#1710](deephaven/web-client-ui#1710) ### Features * Action button tooltips ([#1706](deephaven/web-client-ui#1706)) ([bff6bf9](deephaven/web-client-ui@bff6bf9)), closes [#1705](deephaven/web-client-ui#1705) * Add support for useDeferredApi ([#1725](deephaven/web-client-ui#1725)) ([51ebe1b](deephaven/web-client-ui@51ebe1b)) * Improved preload variable handling ([#1723](deephaven/web-client-ui#1723)) ([ed41c42](deephaven/web-client-ui@ed41c42)), closes [#1695](deephaven/web-client-ui#1695) [#1679](deephaven/web-client-ui#1679) * NavTabList component ([#1698](deephaven/web-client-ui#1698)) ([96641fb](deephaven/web-client-ui@96641fb)) * Reject promise immediately if var not found ([#1718](deephaven/web-client-ui#1718)) ([43d40bd](deephaven/web-client-ui@43d40bd)), closes [#1701](deephaven/web-client-ui#1701) * theming tweaks ([#1727](deephaven/web-client-ui#1727)) ([f919a7e](deephaven/web-client-ui@f919a7e)) ### BREAKING CHANGES * - Subclasses of IrisGridTableModelTemplate or it's subclasses that use backgroundColorForCell may need to update their signature to accept the theme if they are calling the superclass Co-authored-by: deephaven-internal <[email protected]>
For a huge CSV file that means you may have been waiting for several minutes for the data to load to only then blow up because a column has a name that doesn't validate.
We should do column name validation earlier, before the bulk of the load work.
The text was updated successfully, but these errors were encountered: