Services in many languages. Data intelligence: drill: apache email list drill is a low latency sql query engine for hadoop and nosql. Feature: • agility • flexibility • familiarity. Apache drill mahout: apache mahout is a scalable machine learning library designed to build predictive analytics for big data. Mahout has an implementation of apache spark to speed up memory computing. Feature: • collaborative filtering. • classification • clustering • dimensionality reductionapache mahout data integration: apache sqoop: apache sqoop apache sqoop is a tool designed for bulk data transfer between relational databases and hadoop. Feature: • import and export to and from hdfs.
• import and export to and from hive. • import and export to hbase. Apache flume: flume is a distributed, reliable and available service for efficiently collecting, aggregating, and moving large amounts of log data. Feature: • healthy • fault tolerance • simple and flexible architecture based on streaming data flow. Apache waterway apache chukwa: a scalable log collector used to email list monitor large distributed file systems. Feature: • can be expanded to thousands of nodes. • reliable delivery. • you need to be able to store your data indefinitely. Apache chukwa management, monitoring, orchestration:apache ambari: ambari is designed to simplify hadoop management by providing an interface for provisioning, managing, and monitoring apache hadoop clusters. Feature: • provision a hadoop cluster.
• manage hadoop clusters. • monitor your hadoop cluster email list . Apache ambari apache zookeeper: zookeeper is a centralized service designed for maintaining configuration information, naming, providing distributed synchronization, and providing group services. Feature: • serialization • atmicity • reliability • simple api apachezookeeper apache oozie: oozie is a workflow scheduler system for managing apache hadoop jobs. Feature: • a scalable, reliable and scalable system. • supports several types of hadoop jobs such as map-reduce, hive, pig and sqoop. • simple and easy to use. Apache oozie we'll talk more about components in a future article. Stay tuned.: