Skip to content

Virtual Machine CHANGELOG

Phil Hagen edited this page Dec 17, 2024 · 8 revisions

This page includes major changes implemented with each new virtual machine release. Most changes do not require a new virtual machine release, as they can often be added in-place with the built-in updater process. However, system-level changes such as upgrades to major components, fundamental changes in the platform's setup, etc, will only be available with a new virtual machine. Each new virtual machine released is paired with a git branch named public/v*, where * is the date of release. For example: public/v20241217.

  • Update: 2024-12-17: Ubuntu 24.04 base distribution
    • It's here! This is the first release built on Ubuntu, which will be the standard base distribution going forward
      • All Ansible build scripts and documentation have been updated to match this
      • Largely, the SOF-ELK functionality itself is the same as v2024-10-11
      • Notable missing pieces compared to the previous release before:
        • Cerebro, domain_stats, and Elastalert have been removed, at least for now. Hopefully, these will be re-added in the future when it's more feasible to incorporate them to Ubuntu 24.04
    • There are now TWO VMs - one for x86 processors and (drumroll) one for ARM processors!!!
      • THe Ansible build process will work for either architecture
    • Updated all Elastic Stack components to 8.17.0
    • All operating system updates as of 2024-12-17 have been applied
    • Several dashboards have been updated to reflect the ECS naming from the previous release
    • The Introduction dashboard has been streamlined and now better reflects all data loaded to Elasticsearch via an all-records Kibana data view that includes all indices
    • All sources of Windows Event Log data (Snare, Plaso, and KAPE) are stored in the evtxlogs index so they can be addressed together in the dashboard or Discover application
  • MAJOR Update: 2024-10-11: All components updated to use ECS-based fields
    • NOTE: This will be the LAST update to use CentOS as a base distribution. Future release will be based on Ubuntu LTS. (See #292).
    • Updated all Elastic Stack components to version 8.15.2
    • ALL parsers, dashboard components, ES templates, scripts, and more have been updated to reflect Elastic Common Schema (ECS) structure.
      • All fields are documented in the /docs/ecs_fields.csv file, which is the "Rosetta Stone" for any field handled in the overall configuration.
    • Updated nfdump to 1.7.4 (Partially for #287)
    • Increased Logstash's thread stack size to 4MB (#299)
    • Parser updates:
      • Zeek JSON-formatted logs (support for TSV-formatted logs has been removed):
        • conn.log
        • dns.log
        • files.log
        • ftp.log
        • http.log
        • notice.log
        • ssl.log
        • weird.log
        • x509.log
      • Additional GWS log types
      • Fixed Azure log parsing
      • Significantly expanded parsing for Snare logs, with the goal to match events shipped with winlogbeat as closely as possible.
      • Align KAPE and Plaso parsers to match Snare and therefore winlogbeat as closely as possible.
      • Calculate and store Community ID for any event that has the necessary fields to support it.
    • Filebeat now used for NetFlow input
    • Moved to Kibana Lens objects where possible.
    • Includes new csv2json.py script to help migrate away from any CSV-based input data. Preprocessing CSV into JSON will be the only supported input method in the future.
    • Moved documentation-type content (such as READMEs, HOWTOs, and this changelog) to GH Wiki
    • Merged / and /home/ filesystems to make loading large data files easier (#258).
    • All CentOS 7 updates applied as of 2024-11-10
    • So, so, so much more has been integrated to this release! (If you really want to see the entirety of changes, run git diff 'public/v20230623' from the /usr/local/sof-elk/ directory on the VM.)
  • Update: 2023-06-23: Fixes a few errors that slipped through previous day's release
    • Fixes file naming and cleanup of pre-staged GeoIP databases
    • Optimizes Elasticsearch outputs into one stanza
    • Moves all *.asnstr fields to runtime field mappings
  • Update: 2023-06-22: Upgraded to ES 8.8.1 components, ships with outdated but usable GeoIP database files
    • Updated to all ES 8.8.1 components
    • All CentOS 7 updates applied as of 2023-06-23
    • Ships with the final version of MaxMind GeoIP database files from the old license structure that doesn't require an account (#279 and #280)
    • Notify on login when a new public/community edition VM is released (#159)
  • MAJOR Update: 2023-03-29: Upgraded to ES 8.6.2 components, upstream parsers
    • Updated to all ES 8.6.2 components, including Ansible playbooks, configuration files, APIs, etc.
    • All CentOS 7 updates applied as of 2023-03-29
    • Moved all ES templates to composed templates instead of legacy ones
    • Elastalert has been removed because it is not compatible with the newest ES components and Elastalert2 is not yet usable
    • Added specific login screen for Ansible-built instances
    • ELB log format from bedangSen (#262)
    • Initial parser for k8s logs with help from jamesjroddie (#268)
    • Updated geoipupdate to 4.10.0 due to old license key format deprecation
  • Update: 2022-10-25: Upgraded to ES 7.17.7 components, integrated upstream parsers
    • Updated all Elastic Stack components to 7.17.7
    • All CentOS 7 updates applied as of 2022-10-25
    • Updated and added to cloud platform parsers
    • Parser and dashboard improvements
  • Update: 2021-10-06: Upgraded to ES 7.15.0 components, added numerous cloud platform log parsers
    • Updated all Elastic Stack components to 7.15.0
    • Dashboards adjusted for ES 7.15.0
    • All CentOS 7 updates applied as of 2021-10-06
    • LOTS of cloud platform logs are parsed. Various AWS, GCP, and Azure logs are all handled. Several of these require helper scripts (aws-cloudtrail2sof-elk.py, aws-vpcflow2sof-elk.sh, and azure-vpcflow2sof-elk.py). More documentation on these is pending.
    • Added cloud CLI libraries for AWS, GCP, and Azure
    • All dashboards default to KQL instead of Lucene
    • Better handling of GeoIP database download, installation, and updates
    • Numerous other parser and dashboard improvements
  • Update: 2020-03-27: Upgraded to ES 7.6 components, more
    • Updated all Elastic Stack components to 7.6.1
    • VM now built completely from the included Ansible playbooks
    • All CentOS 7 updates applied as of 2020-03-27
  • Update: 2019-06-06: Upgraded to ES 6.7.2 components, KAPE, and more
    • Updated all Elastic Stack components to 6.7.2. This will be the last update with v6.x.
    • NetFlow v9 handling in both live collector and archived ingestion modes - including all IPv6 addresses, of course!
    • Processing of KAPE output data! This is huge. The Kroll Artifact Parser and Extractor (KAPE) is a free suite of tools written by Eric Zimmerman. Place the JSON output files into /logstash/kape/ to be parsed. SOF-ELK currently parses KAPE's MFT and .LNK file output, with more types forthcoming. Dashboards are thin with this release, but the "Discover" tab is particularly helpful for these data types and more dashboard support will be released via the Github-based internal updating process soon.
    • Lots of internal refactoring to make the background system management procedures better
    • Elasticsearch shard count is now forced to 1 (which matches the default in ES7), and replica count is set to zero by default. These should improve performance somewhere between a little and a lot. To override replica count, edit the /etc/sysconfig/sof-elk file.
  • Update: 2019-01-02: Updated to ES 6.5.3 components
    • Updated system components to latest CentOS 7.6
    • Updated all Elastic Stack components to 6.5.3
    • Numerous other config file, parser, and dashboard updates as documented in the Git history
  • Update: 2018-09-18: All-new with 6.4.1
    • VM was rebuilt entirely from scratch with all Elastic stack components at v6.4.0
    • Updated system components to match latest CentOS 7.5
    • Rebuilt and revalidated all Logstash parsers against latest syntax
    • Total overhaul of most supporting_scripts files to address latest Elasticsearch and Kibana APIs
    • Latest versions of domain_stats and freq_server
    • Better handling of dynamic (boot-time) memory allocation for Elasticsearch
    • Moved to MaxMind GeoIP2 format (required by this version of Logstash)
    • Updated several Logstash parsers to handle new fields, correct field types, etc.
    • Rebuilt all Kibana dashboards to handle updated index mappings and field names
    • IPv6 addresses can now be handled as IPs instead of strings
  • Update: 2017-05-18: Another MAJOR update!
    • Daily checks for upstream updates in the Github repository, with advisement on login if updates are available
    • Added dozens of parser configurations from Justin Henderson, supporting the SANS SEC555 class
    • Added experimental Plaso data ingest features, contributed by Mark McCurdy
    • Added freq_server and domain_stats as localhost-bound services
    • Added HTML visualization type to Kibana
    • Dynamically calculate ES_HEAP_SIZE
    • Uses a locally-running filebeat process for file ingest instead of logstash's file inputs
    • Replaced es_nuke.sh with sof-elk_clear.py, adding by-file/by-directory clear and optional reload from disk
    • Overall file cleanup
    • Enforce field naming consistency
    • Various updates to dashboards and parsers
    • Ingest locations changed to /logstash/*/
    • Increased VM's default RAM to 4GB
  • Update: 2016-07-08: This is a MAJOR update!
    • Complete overhaul of the VM
    • Re-branded to SOF-ELK®, with awesome logo to boot
    • Now uses CentOS 7.x as a base OS
    • Latest releases of the ELK stack components
    • All dashboards re-created to work in Kibana 4
    • Optimized Logstash parsing
    • Better post-deployment upgrade path
    • Far more than can be adequately documented here - see the git logs for all the details
    • No longer includes the Xplico application
  • Update: 2015-09-18
    • Adjusted sample source evidence to correspond to the UTC change
    • Re-configured Xplico to start on boot (not Logstash-related)
  • Update: 2015-07-16
    • Force permanence for UTC (seriously, why doesn't replacing /etc/localtime last through an update to the tzdata package?!)
    • Updated all packages
  • Update: 2015-05-05
    • Uses separate ElasticSearch instance instead of Logstash's embedded version (also disabled Groovy scripting engine to address CVE-2015-1427)
    • Uses github-based configuration files, grok patterns, and dashboards (tag 2015-05-05)
    • Updated all packages
  • Update: 2015-03-21
    • Corrects that somehow the time zone was set to US Eastern instead of UTC, which is unforgivable
  • Update: 2015-03-20
    • Modified the embedded ElasticSearch server configuration to mitigate CVE-2015-1427 by disabling the Groovy scripting engine.
  • Update: 2015-01-01
    • Modified the Kibana configuration so the for572.json dashboard is REALLY loaded as the default.
    • Updated all packages
  • Update: 2014-12-18
    • Corrected a Squid log parsing error
    • corrected the Kibana "Home" dashboard to be the FOR572 summary/instruction dashboard
    • Updated all packages
  • Update: 2014-12-03: This is a MAJOR update!
    • Parsing has been updated and is far more robust. The VM cleanly ingests all known data, but let me know if there is any standard data that's not parsed correctly.
    • The VM can ingest live NetFlow v5 by opening UDP port 9995 in the firewall with the following commands using sudo:
      • fw_modify.sh -a open -p 9995 -r udp
    • Archived NetFlow (e.g. ASCII output from nfdump) is ingested just as live NetFlow, so you can load historical evidence alongside live data, and use the same dashboard to examine both simultaneously. See the "Ingesting Archived NetFlow" section for details on this process.
    • Cisco ASA events sent via syslog are fully parsed
    • Much, much more!