Table of Contents

Quick Start Guide for the Driven Online Trial

version 2.1.4

Overview

Driven is a web-based application performance management platform that helps you visualize the operational details around all phases of your data-processing applications: development, debugging, performance tuning, and operator monitoring.

To enable Driven, you must install the Driven Plugin in your Hadoop environment to process telemetry data from your running applications. You do not need to make any changes to your existing data applications to integrate with Driven.

Tip
The simplest way to get started is by using the online hosted service for Driven without signing up for an account, which allows you to try out the product anonymously. After enabling Driven anonymously and using some features, complete the hosted registration page. With the API key, you can enable the Driven Plugin to provide more features, such as performance timelines of application runs.

Prerequisites

  • Your applications must have network access to https://trial.driven.io using the standard port 443. Neither Cascading nor the plugin opens a connection from a Hadoop cluster. Note that the client side of Cascading will be the only Java process attempting to make a remote network connection.

  • Client environment variables, such as HADOOP_CLIENT, YARN_CLIENT_OPTS, and YARN_HEAPSIZE, should be set to the following commonly used values at a minimum. If sufficient memory is available on your system, you might want to allocate more memory to the JVM.

    -Xmx2G -Xms2G -XX:MaxPermSize=256m -XX:+UseParNewGC -XX:+UseConcMarkSweepGC
    -XX:+CMSPermGenSweepingEnabled -XX:+CMSClassUnloadingEnabled

Installing the Driven Plugin

Installing the Driven Agent for Hive and Native MapReduce

To get telemetry data from Hive or native MapReduce applications, install the Driven Agent. See the Driven Agent Getting Started.

Note
If you are monitoring only Cascading applications with Driven, then you do not need to install the agent.

Using Driven

Once plugin installation completes, you can view application performance metrics in Driven. Run your Cascading application as you would normally. When the application starts, a message in the console output appears with a link to view the application on the Driven server:

    ________        __    ________        .__
   /  _____/  _____/  |_  \______ \_______|__|__  __ ____   ____
  /   \  ____/ __ \   __\  |    |  \_  __ \  \  \/ // __ \ /    \
  \    \_\  \  ___/|  |    |    `   \  | \/  |\   /\  ___/|   |  \
   \______  /\_____>__|   /_______  /__|  |__| \_/  \_____>___|  /
          \/                      \/                           \/ ©

  Follow this link to view your application:

  https://trial.driven.io/driven/[your app ID]

After opening a Driven web page, you can open the same URL to view data about the same application at any time.

Whenever Cascading is executed, the running application is given a random unique application ID (app ID). This ID uniquely identifies your application instance.

The full banner is not displayed after you configure the plugin with an API key that is generated by registering for a Driven account.

Sign up on the hosted registration page to obtain an API key if you want to use Driven with additional features, such as team-sharing and performance timelines of application runs. See Using a Driven API Key for how to integrate your data applications with these extra features.

Troubleshooting and Getting Support

The Troubleshooting documentation addresses some issues that you might encounter with Driven, such as what to do if the plugin runs out of memory after installation.

Support for Driven is provided via a public forum:

or by email: