Wednesday 6 June 2018

Flume 1.4.0 Installation & Configuration


Apache Flume is a distributed, reliable, and available system for efficiently collecting, aggregating and moving large amounts of log data from many different sources to a centralized data store. Apache Flume is not only restricted to log data aggregation. Flume can be used to transport massive quantities of event data including network traffic data, social-media-data, email messages also.



Prerequisites

Java
Hadoop

Download flume tar file

Download flume 1.4.0 using below command on /usr/local directory.
$ sudo wget http://archive.apache.org/dist/flume/1.4.0/apache-flume-1.4.0-bin.tar.gz /usr/local
Unpack apache-flume-1.4.0-bin.tar.gz file
$ cd /usr/local
$ sudo tar -xvf apache-flume-1.4.0-bin.tar.gz
Rename apache-flume-1.4.0-bin folder to flume
$ sudo mv apache-flume-1.4.0-bin flume 

Setting up environment for flume

Edit ~/.bashrc file for set up the flume environment by appending the following lines.
$ sudo nano ~/.bashrc 

export FLUME_HOME=/usr/local/flume
export PATH=$FLUME_HOME/bin:$PATH
Reload the configuration file ~/.bashrc with the following command.
# source ~/.bashrc
Rename flume-env.sh.template file to flume-env.sh.
$ cd /usr/local/flume/conf
$ sudo cp flume-env.sh.template flume-env.sh
Edit the flume-env.sh file
$ sudo nano flume-env.sh
Set value for JAVA_HOME and JAVA_OPTS environment variable with java installation directory.(default this variable are commented. Need to uncomment and set values as per below).
JAVA_HOME=/usr/opt/jdk
JAVA_OPTS="-Xms500m –Xmx1000m -Dcom.sun.management.jmxremote"
Note:If we are going to use memory channels while setting flume agents, it is preferable to increase the memory limit in JAVA_OPTS variable. By default, the minimum and maximum memory values are 100 MB and 200 MB respectively. Better to increase these limits to 500 to 1000 MB respectively.

Change the ownership and permissions of the directory /usr/local/flume
$ sudo chown -R hdfs:hdfs /usr/local/flume
$ sudo chmod -R 755 /usr/local/flume

Check version

Check the version of flume.
$ flume-ng version
Flume 1.4.0
Source code repository: https://git-wip-us.apache.org/repos/asf/flume.git
Revision: 756924e96ace470289472a3bdb4d87e273ca74ef
Compiled by mpercy on Mon Jun 24 18:22:14 PDT 2013
From source with checksum f7db4bb30c2114d0d4fde482f183d4fe
Share:

0 comments:

Post a Comment

Total Pageviews

Lables

Powered by Blogger.

Followers