Apache Hive

Apache Hive 1.2.1 updated

Apache Hive was first developed as a Apache Hadoop sub-project for providing Hadoop administrators with an easy to use, proficient query language for their data.Because of this, Hive was developed from the start to work with huge amounts of information...

Apache Accumulo

Apache Accumulo 1.7.0 updated

Apache Accumulo is a mashup of various technologies, from Google's BigTable, to Apache's Hadoop, Thrift and Zookeeper.Compared to Google's BigTable system, Accumulo features a few improvements of its own.These include table cell-based access restrictions,...

Apache Pig

Apache Pig 0.15.0 updated

Apache Pig has spawned from the Apache Hadoop project and is one of its modules that where in charge of providing a way to analyze the data it processed and stored.Pig uses a custom query language called "Pig Latin" which is incredibly easy to learn and...

Apache UIMA

Apache UIMA 2.8.1 updated

Apache UIMA stands for Unstructured Information Management Applications.UIMA is an interesting project from the Apache Foundation that tries to analyze and detect information blocks and patterns inside large unorganized data.As an example to what UIMA...

Apache Slider

Apache Slider 0.80.0 updated

Apache Slider targets Hadoop environments and is based on the database's next generation MapReduce 2.0 standard, also known as YARN. Slider can be used to create YARN-compliant applications that interact with the underlying Hadoop database or database...

Apache CXF

Apache CXF 3.1.4 updated

Services developed via Apache CXF can speak to a wide variety of protocols, allowing developers to build multi-functional applications that can work over a variety of platforms and transport layers. Apache CXF is very extensible, is well documented,...

Apache Tez

Apache Tez 0.8.2 updated

Apache Tez makes it possible to create complex data processing workflows and jobs inside Hadoop databases, serving stored data in a "pre-computed" form, ready to be used as soon as its outputted. Tez takes data stored in various nodes in Hadoop, processes...