Home Explore Blog CI



kubernetes

9th chunk of `content/en/blog/_posts/2015-07-00-The-Growing-Kubernetes-Ecosystem.md`
199478562e4c19ca30fc0630ae24bf7d4da1d75f5cc0f0de0000000100000336
![logo.png](https://lh3.googleusercontent.com/0EQQc3sjVbw1cEYVeT0S5rT1iPLEMHteiKlSMDNqw8lNVOf4vG5qE6pVfvmZlRcg-NoOABC-mMcMSdD8ayrmpok0T91N15QqqmH378ydxK1843dcuJdtEsCnr1Y_RQQo-hWrBfI)

 |

[Pachyderm][22] is a containerized data analytics engine which provides the broad functionality of Hadoop with the ease of use of Docker. Users simply provide containers with their data analysis logic and Pachyderm will distribute that computation over the data. They have just released full deployment on Kubernetes for on premise deployments, and on Google Container Engine, eliminating all the operational overhead of running a cluster yourself.  

 |  
|

![](https://lh4.googleusercontent.com/qxQciTVBkyYDWeSgoxtg7InxQuuXsGSLBDfdxJB9Czo71BzQN5bUugLZhQKkERHqWAnkqHIY2VWi2J7g-pGn4V4AzPE0alBksedou78r0KMZm4QqYTN8QYHIMo4RtVmdw90azYw)

Title: Pachyderm: Containerized Data Analytics on Kubernetes
Summary
Pachyderm is a containerized data analytics engine similar to Hadoop but with the ease of Docker. It allows users to provide containers with their data analysis logic, which Pachyderm then distributes over the data. Pachyderm now fully supports deployment on Kubernetes and Google Container Engine, reducing the operational overhead of managing a cluster.