Parsing and Sending Laravel Logs to ElasticSearch using Logstash

Laravel's error and logging features allows us to log application-specific events that might be proved useful in analyzing behaviour of our application. But problem arise when lets say you have 1 GB size of log file, or 10-15 different laravel application each one producing enormous amount of log data, suddenly answering questions like following is a very difficult task:

How many error that is related to PDOException had occurred last weekend?
Compare the amount of Log::warning generated compared to last month
Sort the list of laravel applications by number of Log::critical recorded in descending order between March 1 to March 15, 2016

Answering to this kind of inquiries might be impossible w/o any sort of tools at hand. Some people actually tries to explicitly record this events in the business logic of Laravel app itself, but that practice is really bad since that weighs down the performance of the application.

It is more ideal for you to accumulate those data in a simple log file and forward it to a background processing server that can then further process the information into a useful form. This way, your application entirely focuses in what it is supposedly doing -- serve http request in the most fastest and efficient manner.

Logstash allows us to process those gigantic log files and break them down into manageable parts. It can also monitor the log files for any new entry and automatically process it. The processed data can then be forwarded to ElasticSearch.

ElasticSearch is a Java-based search engine with analytics capabilities, it allows you to store enormously huge amount of data, then analyze it in a way that is more than anyone could imagine. It also has a built-in REST API, on which you can query the data that you had stored on it, or use Kibana that can allow you to visually build queries instead of hitting the REST API directly. To summarize:

ElasticSearch - for storing, analysis and API of data
LogStash - for parsing and pre-processing of raw log files
Kibana - serves as the user-interface of ElasticSearch so you can visually see the output of ElasticSearch results in different format (tabular, Pie Charts, Historgram etc).

The combination of this three great products is what usually called as ELK Stack.

Once you had forwarded your Laravel log files into ELK stack, searching, analysis and any other form of inquiries is very easy to answer no matter how huge the error log files you have.

Setting up the ELK stack in your local machine

The easiest way to have an ELK stack running in your machine is to setup a vagrant box that has those 3 softwares pre-installed. That way, you don't have to worry about the details of setting those up. Follow these instructions to install it in your machine:

Install VirtualBox
Install Vagrant
Install Git

Once you have those installed, clone the vagrant-elk-box

git clone https://github.com/buonzz/vagrant-elk-box.git
cd vagrant-elk-box
vagrant up

It might take a while for it to complete, since it will download first the basebox and install everything for you. After it had completed, you should have the following available in your local machine:

ElasticSearch http://localhost:9200/
Kibana http://localhost:5601/
Logstash - is a command line tool you can use when you login to the VM via ssh

In production, you should not process logs in your local machine! ELK stack is usually installed in some seperate cluster of servers not facing your end-users (i.e. not in web servers).

Processing an Example laravel.log file

In able to illustrate the process, I created a sample project with sample laravel.log file in GitHub. You first need to clone this inside the vagrant-elk-box folder.

git clone https://github.com/buonzz/logstash-laravel-logs

Take a look at the contents of logstash-laravel-logs/logs/laravel.log
This is basically a kind of file that you will usually see inside storage/logs/ folder of a Laravel 5.2 application. This log file basically simulates different kind of log output you will find in a typical Laravel log file:

Log::emergency
Log::alert
Log::critical
Log::error
Log::warning
Log::notice
Log::info
Log::debug

Usually, you would like to pre-process the logs generated by those into following:

Group and Filter logs by environment (local, production, testing etc)
Group and Filter logs by type (error, emergency, alert, critical,warning, notice, info, debug)
Group by date, month, year. So you can query all logs given a particular date-range
Search by particular keyword. For example, If you want to view all logs that mentions the keyword "QueryException", you should be able to retrieve it.

Parse log files using LogStash

Now, make sure you are inside the vagrant-elk-box folder. Login to the VM by executing

vagrant ssh

Once inside the VM, the logstash command is available to you in this path: /opt/logstash/bin/logstash

The sample project that we had cloned earlier is available inside the VM in this path: /vagrant/logstash-laravel-logs

So, what we are going to do is process the /vagrant/logstash-laravel-logs/logs/laravel.log and load it to ElasticSearch running in http://localhost:9200/

In able for logstash to do anything, it needs a configuration file where it can determine the following things:

Input - what is the source of data to be processed?
Filter - How to parse the input data?
Output - Where to put the processed data?

If you look at the sample configuration file in /vagrant/logstash-laravel-logs/logstash.conf, you will see that we had specified the following:

Input - we specified that we would like for logstash to take the source data from stdin (i.e. we will specify the laravel.log location in command line, more on this later). We also specified a codec called multiline, because we would like to account for log entries that spans multiple lines (stack trace generated by Laravel errors)
Filter - we used grok plugin, to match a certain pattern to look for certain keywords and parse it.
Output - we specified that we would like to store it in ElasticSearch, running on 127.0.0.1, with index name (database name) laravel_logs.

I wont go over that much of the syntax of the logstash configuration file as it requires its own dedicated tutorial. But what is important in here is we understand how we are actually informing logstash that we are giving it a log file that is generated by a Laravel application

Now lets process some log files!
While logged inside the vagrant-elk-box vm, execute the following:

cd /vagrant/logstash-laravel-logs
/opt/logstash/bin/logstash agent -f logstash.conf  < logs/laravel.log

Note: normally, you will not specify the log file via command line as we did above, but as a parameter in the input section of the conf file. We only did it this way, since we only want to process the file for one-time only, then exit. If we instead used a file via conf file, it will continously "watch" the file for any new entry appended to it until you explicitly kill it.

After the script is done executing, you will see a "Logstash shutdown completed". Our log data is now available in ElasticSearch!

Now head on and open a browser in your machine and open http://localhost:9200/laravel_logs/_search?pretty in your browser

Congratulations! you just had successfully processed laravel.log file via LogStash and loaded into ElasticSearch. Here is a sample output that you should be seeing

Few things to note above are

Each log file corresponds to each hits.hits[ Object ] entry in the output
the _index is the index (database) name we specified earlier
the _id is the auto-generated id by ElasticSearch
_source is where our log data went into
_source.message is the original unparsed single entry in the log file (notice how multiline exception is recognized as a single entry, even though they span multiple lines in laravel.log)
_source.timestamp is the date/time value in the log file entry, this will allow us to filter log files by date or date ranges in the future
_source.env is the laravel environment as parsed in the log entry (can be production, local, testing etc). This will allow us to filter logs by environment name
_source.severity is the severity level in the log entry. This will allow us to view only specific group of log type (emergency, alert, critical, error, warning, notice, info and debug)

On the future articles, I will illustrate the power of ElasticSearch and how do you actually make use of this data to build stunning dashboards and view errors that is happening in multiple laravel apps across many servers and many more -- that is, building Kibana dashboards to actually visualize these data into a more comprehensive format.

UPDATE: For some reason, Puppet dependencies of the vagrant-elk-box stopped working. So I have to spin up a quick bash script to provision a simple one, which you can find in here It should be able to get you going on this tutorial. The links and snippets above is updated to use my own vagrant-elk-box.

Did you find this useful?

I'm always happy to help! You can show your support and appreciation by Buying me a coffee (I love coffee!).