Start developing with HBase in under 15 min
Easily import your existing data for online use
Store data using complex, evolvable data schemas
What is Kiji?
Kiji is developed on the Hadoop ecosystem, using HBase as its original underlying data store. With Kiji, developers can create a flexible and comprehensive entity-centric schema that enables a 360 degree view of each customer. Data is stored in a rich, compressed, binary Avro format allowing the application to support complex data types. Kiji handles all aspects of serialization and deserialization while maintaining schema metadata, ensuring backwards compatibility as an application’s schema evolves, and captures real-time application interactions.
Most organizations are collecting and storing data in a distributed file systems, such as HDFS, and key value stores, such as HBase, in order to better serve individual customers. However, these tools are very raw, difficult to use, and have no underlying framework for integration. Kiji is the middleware necessary to ingest detail data, stream real-time data, build predictive models and deploy those models on the fly. The various components of Kiji address the common use cases and solve the common challenges experienced by: developers, engineers, analysts and data scientists.
The Kiji Project is modularized into separate components to support a wide range of usage and encourage clean separation of functionality. The Bento Box contains all Kiji modules assembled in a self-contained download. Each module can also be individually downloaded on GitHub.
KijiSchema: simplifies real-time storage and retrieval of diverse data from primitive types to objects, time-series and event streams. KijiSchema handles challenges with serialization, schema design and evolution, and meta data management common in NoSQL storage solutions.
KijiSchema DDL Shell: provides a Data Definition Language that allows for the creation, inspection, and modification of schemas for KijiSchema.
KijiMapReduce: provides a powerful paradigm to apply MapReduce in both batch and real-time workloads. KijiMapReduce introduces producers to perform record-wise analytics and gatherers, which build predictive models by analyzing aggregate behaviors.
KijiMapReduce Library: is a library of helpful examples and useful implementations of MapReduce jobs that can be created within Kiji.
Kiji Hive Adapter: provides HiveQL access to Kiji data through a familiar SQL shell.
Kiji Model Repository: is a library of machine learning tools built on top of KijiExpress.
KijiREST: provides an HTTP REST API for front-end developers to access Kiji data and to trigger model scoring.
KijiScoring: provides the real-time scoring of predictive models within your application.
Download the Kiji source code and start hacking on Kiji! Learn more about Kiji code style here.
Find bugs or suggest a new feature. If you have an idea for a new feature, create a ticket for the feature under the appropriate project.
Submit patches. Learn more about code review here.
Keep up to date with Kiji!Developer Mailing List User Mailing List Issues Mailing List
Kiji User Groups are a great way to learn more about Kiji use cases, best practices for building and deploying real-time Big Data Applications and upcoming Kiji features.
If you are interested in starting a Kiji Meetup in an area near you, let us know.
Ebi Bento Box 2.0.1Download BentoBox 2.0.1
The Bento Box contains all Kiji modules assembled in a self-contained download. The Bento Box comes with a standalone Hadoop and HBase cluster and command-line tools for interacting with Kiji tables and data
Kiji Music Recommender
Build your own recommendation system: