-
Notifications
You must be signed in to change notification settings - Fork 41
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Docs] Add caveat about autodetect_column_names #79
Comments
Note that the new cvs codec will help for the cases of changing files with new headers. |
Same behaviour with the csv codec:
input{ Here is the exception for a new header line: Data from the new file is handled in the previous format, which of course leads to inconsistencies. |
@colinsurprenant - could you elaborate on why you think the csv codec might address the case of changing files with new headers? |
https://www.elastic.co/guide/en/logstash/current/plugins-filters-csv.html#plugins-filters-csv-autodetect_column_names
When using
autodetect_column_names
, if eitherthen the behaviour is not what might be expected.
In the first case, the column names will be re-read from the next event where LS left off before being stopped - so the event data in that row becomes the column names for the rest of the file.
In the second case, column names are not re-read on starting a new file, so the data in the new file is treated as if it were in the format of the previous file.
Additionally, I think in the second case, the header line will be ingested as data, even if it is column names.
We should add a caveat in the docs to cover these scenarios.
Something along the lines of a note like:
The text was updated successfully, but these errors were encountered: