-
Notifications
You must be signed in to change notification settings - Fork 8.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Data Visualizer fails to import data without a timestamp #63526
Comments
Pinging @elastic/ml-ui (:ml) |
Only highly structured formats like CSV and NDJSON are accepted without timestamps. The reason is that for semi-structured log files the definition of the first line of each message is the line containing the identified timestamp, so without a timestamp there’s no way to split the file into messages. We should probably spell this out more clearly in the docs. Currently it is buried away in https://www.elastic.co/guide/en/elasticsearch/reference/current/ml-find-file-structure.html in the sentence:
|
I think we could make it possible to support import of semi-structured log files without timestamps, by implementing #38868 and elastic/elasticsearch#55219. |
This feature was very useful indeed and was promoted in many of the elasticsearch/kibana tutos or videos. It would be good to have it back as now I am stuck with such a basic stuff. Is there any workaround ? |
@cspielmann are you complaining that the entire feature has disappeared, or specifically that it doesn't work for semi-structured log files without timestamps? I believe the whole feature was accidentally made inaccessible on a basic license for one minor release and then fixed in the following patch release. There will be a separate issue for that somewhere if that's the problem you've got. Only highly structured formats like CSV and NDJSON are accepted without timestamps. That has always been the case. We could do an enhancement for semi-structured log files without timestamps, but that has never been demonstrated in a video as it has never worked. So please be more specific about exactly what doesn't work for you. |
Hello,
I am sorry I didn't want to be mean.I really enjoy using your product and
like your support.
The fact is that today I am not able to reproduce.(?!)
I am pushing a very simple csv and it works.
any way, thanks for your reply
…On Wed, Dec 2, 2020 at 4:30 PM David Roberts ***@***.***> wrote:
@cspielmann <https://github.com/cspielmann> are you complaining that the
entire feature has disappeared, or specifically that it doesn't work for
semi-structured log files without timestamps?
I believe the whole feature was accidentally made inaccessible on a basic
license for one minor release and then fixed in the following patch
release. There will be a separate issue for that somewhere if that's the
problem you've got.
Only highly structured formats like CSV and NDJSON are accepted without
timestamps. That has always been the case. We could do an enhancement for
semi-structured log files without timestamps, but that has never been
demonstrated in a video as it has never worked. So please be more specific
about exactly what doesn't work for you.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#63526 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ADRWUIBMOO3DI7JEJRMXU4TSSZMPRANCNFSM4MIB6GTQ>
.
|
No problem, I didn't think you were being mean, it's just that I wasn't completely clear what didn't work for you. It seems that you explained it in https://discuss.elastic.co/t/upload-csv-file-without-timestamp-to-kibana-with-ml-fails/257376. What happened is that there was something about your CSV file that failed to upload that meant the file structure finder didn't think it was CSV. As a result, it tried to analyse it as semi-structured text, and currently that only works when a timestamp can be detected. So, the next question is, why wasn't your CSV file recognized as a CSV file? There are a few possible reasons:
If it is reason 2 or 3 then you should upgrade to 7.10 where you will be able to take advantage of elastic/elasticsearch#55735 and #74376. When the initial analysis fails due to one of those reasons you'll be able to go to the overrides flyout and tell it that your file is CSV, and then up to 10% of the rows will be allowed to have a column count that's inconsistent with the header row and it will still be imported as best it can be. The other benefit of upgrading is that you'll get the explanation of why it wasn't considered to be CSV, for example, "row 375 had 19 columns whereas the header had 17". This is really hard to spot by eye in a big text file (although easier in a spreadsheet program). |
I experienced a similar issue. At the end it turned out the the CSV data was corrupt, because of the chosen comma separator. Exporting the dataset using a semicolon solved the issue. The error message seems to be just a bit misleading. A timestamp is not needed - as already described in comments above. |
Kibana version: 7.7.0 BC6
Elasticsearch version: 7.7.0 BC6
Server OS version: Windows 2012 Server
Browser version: Chrome (also IE11)
Browser OS version: Windows 10
Original install method (e.g. download page, yum, from source, etc.): zip files default distribution
Describe the bug: If Data Visualizer should accept files without timestamps, its not working in this version.
Steps to reproduce:
Import a CSV, NDJSON, or log file
xpack-ascii.txt
Error on screen:
File could not be read
Bad Request: [illegal_argument_exception] Could not find a timestamp in the sample provided
Expected behavior: It should ingest the data
Screenshots (if relevant):
Errors in browser console (if relevant):
Provide logs and/or server output (if relevant):
Any additional context:
The text was updated successfully, but these errors were encountered: