-
-
Notifications
You must be signed in to change notification settings - Fork 18.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Excel Document Passing Kwargs to Engine #25723
Comments
cc @WillAyd do you recall when this changed? @Hugovdberg does passing that to |
@TomAugspurger This appears to have changed in version 0.24, when |
Thanks for the clarification. So using In that case, is opening the file yourself with It looks somewhat difficult to pass kwargs through to the opening of the file. |
I would think that the current implementation could pass the kwargs straight to the reader implementation, which could then handle the arguments as necessary. In this case I think it would suffice to add |
Not at a computer to check but I think we recently disabled this because it was causing confusion.
Are you sure on_demand was even get passed through? I didn’t think ExcelFile was passing along kwargs to open_workbook but was silently discarding.
You could alternately instantiate an xlrd Book with that argument and pass to ExcelFile.
…Sent from my iPhone
On Mar 14, 2019, at 6:48 AM, Hugo van den Berg ***@***.***> wrote:
I would think that the current implementation could pass the kwargs straight to the reader implementation, which could then handle the arguments as necessary. In this case I think it would suffice to add **kwds on lines 380, 422, 424, 642, and 653. Future readers should then also accept a **kwds argument, and handle those as necessary (possibly throwing an error that the engine doesn't accept additional arguments).
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub, or mute the thread.
|
Right so this was intentionally changed in #24423. The keywords before were silently ignored so this actually had no effect - the error now is intentional and being explicit about this. As mentioned can you try something like: book = xlrd.open_workbook('your_book.xlsx', on_demand=True)
with pd.ExcelFile(book) as the_file:
... That should actually do something with the keyword |
Thanks for verifying @WillAyd. I agree that if you need control over how the file is opened, then opening it yourself and supplying the opened object to ExcelFile is the best option. @Hugovdberg would you be interested in submitting a PR explaining that at http://pandas-docs.github.io/pandas-docs-travis/user_guide/io.html?highlight=excelfile? |
**kwds
Can I work on this? |
In master branch there is no excel.py so where to make changes? |
Everything's Normal! Earlier (before pandas version 0.24) you could pass any number of parameters into the ExcelFile() function without any error as its function signature is (io,**kwds), which allows any number of parameters to be passed. However, with the new release of pandas they have changed the function signature to (io,engine=None), which allows only two arguments passed, which justifies the error. |
@codehard123 the purpose of this issue is to add documentation around how to pass keyword arguments to a particular engine. If interested you could use the information above and add to the section linked by @TomAugspurger |
Code Sample, a copy-pastable example if possible
Problem description
This code used to work, since extra keyword arguments were passed to
xlrd.open_workbook
. Aspd.ExcelFile
is now decoupled fromxlrd
this behaviour was removed. It now returns an error:In a way this is related to #25523, but this is a more general issue, and might or might not solve the problem in the other issue.
Expected Output
No error, and the keyword arguments respected. I understand that when other engines are implemented the
on_demand
argument might not make sense, but they might take similar arguments not explicitly made available in the interface ofpandas.ExcelFile
.Output of
pd.show_versions()
INSTALLED VERSIONS
commit: None
python: 3.7.2.final.0
python-bits: 64
OS: Windows
OS-release: 10
machine: AMD64
processor: Intel64 Family 6 Model 42 Stepping 7, GenuineIntel
byteorder: little
LC_ALL: None
LANG: None
LOCALE: None.None
pandas: 0.24.1
pytest: 4.2.0
pip: 19.0.1
setuptools: 40.7.3
Cython: 0.29.5
numpy: 1.15.4
scipy: 1.2.0
pyarrow: None
xarray: None
IPython: 7.2.0
sphinx: 1.8.4
patsy: 0.5.1
dateutil: 2.7.5
pytz: 2018.9
blosc: None
bottleneck: 1.2.1
tables: 3.4.4
numexpr: 2.6.9
feather: None
matplotlib: 3.0.2
openpyxl: 2.5.14
xlrd: 1.2.0
xlwt: 1.3.0
xlsxwriter: 1.1.2
lxml.etree: 4.3.0
bs4: 4.7.1
html5lib: 1.0.1
sqlalchemy: 1.2.17
pymysql: None
psycopg2: 2.7.6.1 (dt dec pq3 ext lo64)
jinja2: 2.10
s3fs: None
fastparquet: None
pandas_gbq: None
pandas_datareader: None
gcsfs: None
The text was updated successfully, but these errors were encountered: