Apache Fluo (incubating) is an open source implementation of Percolator (which populates Google's search index) for Apache Accumulo. Fluo makes it possible to update the results of a large-scale computation, index, or analytic as new data is discovered. For more information, take the Fluo tour.
When combining new data with existing data, Fluo offers reduced latency when compared to batch processing frameworks (e.g Spark, MapReduce).
Incremental updates are implemented using transactions which allow thousands of updates to happen concurrently without corrupting data.
The core Fluo API supports simple, cross-node transactional updates using get/set methods.
Combine new data with existing data without having to reprocess the entire dataset.
Fluo applications consist of a series of observers that execute user code when observed data is updated.
The Fluo Recipes API builds on the core API to offer complex transactional updates.
Apache Fluo is an effort undergoing incubation at The Apache Software Foundation (ASF), sponsored by the Apache Incubator PMC. Incubation is required of all newly accepted projects until a further review indicates that the infrastructure, communications, and decision making process have stabilized in a manner consistent with other successful ASF projects. While incubation status is not necessarily a reflection of the completeness or stability of the code, it does indicate that the project has yet to be fully endorsed by the ASF.