Top Database Tools Scripts for Apache Software Foundation
Apache Derby is one of the first database engines the Apache Software foundation hosted under its umbrella before big names like Yahoo's Hadoop, Facebook's Cassandra, or IBM's CouchDB were also donated to it.
Coded from the beginning to...
Apache Ambari allows database administrators to install, manage and monitor Hadoop clusters.
Collecting metrics is done via Ganglia, while email alerts are sent out via Nagios.
Amabari is basically a Web-based administration GUI for Hadoop, a tool that...
Apache HBase was modeled after Google's internal BigTable database system and was built atop the famous Apache Hadoop data processing framework and distributed computing engine.
Because it was built on Hadoop, some of its cool features are also inherited...
Apache OpenJPA comes in two separate branches, each production level material.The differences between the two are the standards they implement. The 1.x branch follows the JSR-220 Enterprise Java Beans 3.0 specificationm while the 2.x branch was modeled...
Apache Commons DBCP stands for Apache Commons DataBase Connection Pooling.This package implements the vaunted connection pooling technique, a system to allow the caching of database connections, so users won't need to reconnect to the database every time...
Apache Slider targets Hadoop environments and is based on the database's next generation MapReduce 2.0 standard, also known as YARN. Slider can be used to create YARN-compliant applications that interact with the underlying Hadoop database or database...
Apache Empire-db functions like a Java ORM but it provides extra support for basic data persistence methods.
This implicitly means better targeted SQL statements that provide better and more efficient query results.
Empire-db is easy to use, not requiring...
Apache Parquet is a "columnar" data storage format that was specifically created for the Apache Hadoop family of projects.
Parquet is recommended to be used with large data, mainly because it uses a complex data compression system, relying on a series of...
A pipeline is a concatenation of operations to perform a specific job, arranged so that the output of each element is the input of the next.Apache Crunch provides an easier method of dealing with Apache Hadoop MapReduce pipelines.Crunch simplifies this...
Apache Gora aims at providing a common ground on which developers can create tools that can handle large quantities of information, regardless of their underlying storage unit.
Gora is written in Java and was created as an alternative to other Java ORMs...