: A lightweight web server that allows for remote execution and monitoring of transformations and jobs. Key Concepts: Transformations vs. Jobs
: The primary graphical user interface (GUI) used to design and test transformations and jobs through a drag-and-drop environment. Pentaho Data Integration Beginner’s Guide
: A command-line tool specifically for executing transformations. Kitchen : A command-line tool used to execute jobs. : A lightweight web server that allows for
: It supports data extraction from numerous sources, including relational databases, Excel, XML, Hadoop, and Amazon S3. PDI utilizes a suite of tools, collectively often
PDI utilizes a suite of tools, collectively often referred to by their original names (the "Kettle" project components):
: A common first step involves creating a simple transformation to read a file, apply a basic change (like splitting a name field), and output it to a new format.
: Focused on high-level orchestration and flow control. They coordinate transformations and other job entries (like sending an email or checking if a file exists) in a sequential manner. Primary Features and Benefits