Apache Hadoop

Apache Hadoop 2.7.2 updated

Apache Hadoop was initially developed by Yahoo and the project is a combination between the previous Apache Hadoop Core and Apache Hadoop Common repos. The Hadoop project has gained a lot of notoriety thanks to its great results in implementing a...

Apache POI

Apache POI 3.13 updated

Apache POI was designed to work with the Office Open XML standards (OOXML) and Microsoft's OLE 2 Compound Document format (OLE2). Using POI, developers can read and write MS Excel files, MS Word files, MS PowerPoint files and MS Outlook messages...

Apache Chukwa

Apache Chukwa 0.7.0 updated

Apache Chukwa can be used to log the activity that happens into a complex Hadoop distributed computing system. Chukwa will help developers track down problems and solve bugs in the data processing applications, by providing a toolkit for not only...

Apache Avro

Apache Avro 1.8.0 updated

Apache Avro provides a way to serialize data using JSON schemas. These schema files are always present with the data, allowing developers to ship both the data and its structure to any application that may need it. Because of this data can be read and...

Apache NiFi

Apache NiFi 0.6.0 updated

Apache NiFi was created for distributed computing systems where data is processed on multiple servers before being sent to the user or to a storage container. While this might seem as a very complicated and time-consuming task, NiFi comes with a Web-based...

Apache Cassandra

Apache Cassandra 3.5.0 / 2.2.6 updated

Apache Cassandra was initially built by Facebook for their own servers and services, using principles developed by Google and Amazon for BigTable and DynamoDB. Currently the project was donated to the Apache Foundation who continues to developed it...