-
Notifications
You must be signed in to change notification settings - Fork 5.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
HiveMetastore outputFormat should not be accessed from a null StorageFormat #6972
Comments
Same issue on EMR. External hive table to S3 or DynamoDB works fine. But when trying to access the DynamoDB table, it throws the same error: |
Any update on this issue? We ran into it with S3-backup tables on Hive on version 0.173. |
I have same error on emr presto 0.166 Query 20170425_112726_00155_nfp6g failed: outputFormat should not be accessed from a null StorageFormat but next queries is correct work |
As of 0.198, I am also experiencing this error pretty regularly against S3 backed Parquet tables. Most tables have been created through Hive, though we also have tables that have been created through Presto. Offending tables have the following formats:
We have BI tools and data exploration tools that query |
This issue also shows up for Hive external table created on AWS EMR using AWS ElasticSearch as source and again, this format uses a storage handler as well. Below table can be queried using hive on AWS EMR but cannot be queried using presto on AWS EMR and similarly on AWS Athena (which seems to be using presto as well) Hive DDL :CREATE EXTERNAL TABLE prod_elastic_search.insights ( Error Details:presto> select * from prod_elastic_search.insights limit 10; |
We have tables using Hive StorageHandler and living in same metastore with regular and presto-queryable tables. External tools like JetBrain's Data Grip and BI tools, which use information_schema and system.jdbc.columns for displaying tables and columns regularly fail because of the StorageHandler tables, resulting different problems in the tools. Presto should handle this exception and ignore unqueryable tables. |
here is a simple makeshift : public String getOutputFormat() |
We should backport the fix from: trinodb/trino#568 |
I've opened a PR to backport the fix here: #12973 |
In lower version of Presto, sometimes the old query will throw "outputFormat should not be accessed from a null StorageFormat" error (see prestodb/presto#6972).
In lower version of Presto, sometimes the old query will throw "outputFormat should not be accessed from a null StorageFormat" error (see prestodb/presto#6972).
In lower version of Presto, sometimes the old query will throw "outputFormat should not be accessed from a null StorageFormat" error (see prestodb/presto/issues/6972). Sometimes there can be many many tables, it would be nice if we filter only for certain sources. We might also add access control based on the table filter.
And load schemas one by one. This should improve performance for large Presto instances where a single schema may contain thousands of tables. Plus, in lower version of Presto, sometimes the old query will throw "outputFormat should not be accessed from a null StorageFormat" error (see prestodb/presto#6972). This change allows us to skip this error and still return valid results.
And load schemas one by one. This should improve performance for large Presto instances where a single schema may contain thousands of tables. Plus, in lower version of Presto, sometimes the old query will throw "outputFormat should not be accessed from a null StorageFormat" error (see prestodb/presto#6972). This change allows us to skip this error and still return valid results.
And load schemas one by one. This should improve performance for large Presto instances where a single schema may contain thousands of tables. Plus, in lower version of Presto, sometimes the old query will throw "outputFormat should not be accessed from a null StorageFormat" error (see prestodb/presto#6972). This change allows us to skip this error and still return valid results.
This issue has been automatically marked as stale because it has not had any activity in the last 2 years. If you feel that this issue is important, just comment and the stale tag will be removed; otherwise it will be closed in 7 days. This is an attempt to ensure that our open issues remain valuable and relevant so that we can keep track of what needs to be done and prioritize the right things. |
With this commit , starting Presto 0.152, we can't query System Connector's JDBC/HiveMetastore if Hive tables were created from StorageHandlers where INPUT/OUTPUT Formats may not exists.
Repo:
Ran on Presto 0.152.3/emr-5.2.0 (Hive metastore can be external/internal).
The text was updated successfully, but these errors were encountered: