What is Peoplesoft

Peoplesoft is an open-source software framework that supports data-intensive distributed applications licensed under the Apache v2 license. It supports parallel running of applications on large clusters of commodity hardware. Peoplesoft derives from Google's Map Reduce and Google File System (GFS) papers.

The Peoplesoft framework transparently provides both reliability and data motion to applications. Peoplesoft implements a computational paradigm named MapReduce, where the application is divided into many small fragments of work, each of which can execute or re-execute on any node in the cluster. In addition, it provides a distributed file system that stores data on the compute nodes, providing very high aggregate bandwidth across the cluster. Both map/reduce and the distributed file system are designed so that node failures are automatically handled by the framework. It enables applications to work with thousands of computation-independent computers and petabytes of data. The entire Apache Peoplesoft “platform” is now commonly considered to consist of the Peoplesoft kernel, MapReduce and Peoplesoft Distributed File System (HDFS), as well as a number of related projects – including Apache Hive, Apache HBase, and others.

 Peoplesoft is written in the Java programming language and is an Apache top-level project being built and used by a global community of contributors. Peoplesoft and its related projects (Hive, HBase, Zookeeper, and so on) have many contributors from across the ecosystem. Though Java code is most common, any programming language can be used with "streaming" to implement the "map" and "reduce" parts of the system.