The TIBCO ActiveMatrix BusinessWorks Plug-in for Big Data facilitates the connectivity between your Enterprise Service Bus (ESB) and Hadoop (Apache Hive and Apache Pig).
The Hive connection works similarly to a JDBC connection for relational databases. Configure the connection and add the activity with the particular SQL statement you want to execute. The Hive activity will provide you input, result, error handling and compensation feature. All MapReduce* code is generated automatically under the hood. To maximize the flexibility and control it is also possible to write your own MapReduce java code and incorporate this into your ActiveMatrix BusinessWorks service.
MapReduce is an opensource framework developed and introduced by Google for the execution of massive amount of calculations within a short period of time. It forms the heart of Hadoop and is most known for its massive scalability options on commodity hardware.
The ActiveMatrix BusinessWorks Plug-in for Big Data will add a new palette called “Hadoop” with the following options:
Hive = The Hive activity is used to facilitate querying and managing large datasets;
MapReduce = The Mapreduce activity is used to create and queue a standard Mapreduce job or a streaming Mapreduce job;
Pig = The Pig activity is used to create and queue a Pig job;
WaitForJobCompletion = The WaitForJobCompletion activity is used to wait for the specified jobs to complete;
By using the TIBCO ActiveMatrix BusinessWorks Plug-in for Big Data we solve a key challenge to integrate the input and result of Hadoop into the rest of the enterprise. But this is just one of the many challenges of Big Data.