Skip to content

Latest commit

 

History

History
104 lines (73 loc) · 4.24 KB

README.md

File metadata and controls

104 lines (73 loc) · 4.24 KB

logsearch-for-aws

A BOSH release to help you ingest, parse, and visualize your AWS logs.

About

We can currently handle these log types...

We rely a process flow which looks like...

  1. AWS dumps a new log into your S3 bucket
  2. Your S3 bucket sends a notification to a SQS topic about it
  3. The s3-notification job waits for messages on that SQS topic
  4. It downloads the log file from S3 and normalizes the contents
  5. Events get pushed into the redis queue
  6. Your regular logsearch cluster takes care of the rest

We like this setup because...

  • can continue adding support for all the AWS log formats
  • can reuse our existing logsearch infrastructure for parsing and searching log events
  • using a queue is much more scalable than watching entire bucket paths for new files

Usage

Before getting started, you must set up the following in AWS...

  • IAM credentials which allow working off the queue and downloading the referenced S3 files (sample)
  • Separate SQS Queues and S3 Notifications for the log types and files you use (sample)

Fetch and install our logsearch configuration (details)...

$ wget -O- https://logsearch-for-aws-boshrelease.s3.amazonaws.com/release/latest/logsearch-config.tgz | tar -xz
$ open \
  logsearch-config/logs/elasticsearch-mappings.json \
  logsearch-config/logs/logstash-filters.conf
...snip...

Upload the release to your director...

$ bosh upload release https://logsearch-for-aws-boshrelease.s3.amazonaws.com/release/latest/tarball.tgz

Update your deployment to add the release, templates, and properties...

releases:
  ...snip...
  - name: "logsearch-for-aws"
    version: "latest"
jobs:
  - name: "l4aws"
    templates:
      ...snip...
      - release: "logsearch-for-aws"
        name: "s3-notification"
properties:
  ...snip...
  l4aws:
    access_key_id:     "...snip..."
    secret_access_key: "...snip..."
    s3_notification:
      queues:
        # [ queue region , queue name         , log format type ]
        - [ "us-east-1"  , "l4aws-billing"    , "billing"       ]
        - [ "us-east-1"  , "l4aws-cloudtrail" , "cloudtrail"    ]
        - [ "us-east-1"  , "l4aws-s3"         , "s3"            ]

Implementing a New Log Type

If you want to add parsing for a new log type... here are the things you should keep in mind...

  1. Update s3-notification job templates...
    1. config/logstash.conf.erb - add it to the list used to create file inputs
    2. bin/main_ctl - add it to the list used to mkdir directories
    3. logsearch/logs.yml - add a dummy entry to ensure the logsearch-config reference is active
  2. Create src/scripts/transform-{aws-log-format-name} which will convert the raw S3 log file format into a single event per line.
  3. Implement the filtering configuration in src/logsearch-config/logs/{aws-log-format-name}...
    1. Write {aws-log-format-name} to name
    2. Write your test data and expectations to expected.testdata
    3. Write your logstash filter configurations to logstash-filters.conf
    4. Write your elasticsearch mapping configurations to elasticsearch-mapping.json
  4. Update src/aws-helper/s3-sns-sqs/generator/regenerate.sh (and then execute) to add the new log type.
  5. Update src/aws-helper/iam/sample-policy.json to add the new sample directory (if applicable).
  6. Update README.md to add the log file format to our list of supported logs.

Once updated, run ./bin/logsearch-config to test your log parsing filters and generate new configuration in ./logsearch-config. Use those configuration files in your test environment and verify your new log runs through the whole process.

Share your work with a Pull Request :)

Additional Resources

  • Replay Helper - (fairly) easy way to replay some old logs into the queues for re-analysis

License

Apache License 2.0