oobaloo.co.uk
pingles
http://oobaloo.co.uk/tag/hadoop
Blog of Paul Ingles. Laquo; Back to blog. Introducing Bifrost: Archive Kafka data to Amazon S3. We're happy to announce the public release of a tool we've been using in production for a while now: Bifrost. We use Bifrost to incrementally archive all our Kafka data into Amazon S3; these transaction logs can then be ingested into our streaming data pipeline (we only need to use the archived files occasionally when we radically change our computation). From Pinterest and Kafka's old hadoop-consumer. If you ...
cascading.org
Driven for Cascading | Cascading
http://www.cascading.org/2014/02/14/driven-for-cascading
Written February 14th, 2014. We’re happy to announce that a free public beta for Driven. Takes Cascading application development to the next level with management and monitoring capabilities for your Cascading apps:. Monitor your enterprise data apps built with Cascading (including Scalding, Cascalog, Lingual, Pattern and other DSLs). Quickly identify failed and poorly performing apps. Visually see your data application execute. Get started with Driven:. Http:/ forums.cascading.io. Sign-up for Updates *.
cascading.org
Fluid 1.0 | Cascading
http://www.cascading.org/2014/12/08/fluid-1-0
Written December 8th, 2014. We are happy to announce that Cascading Fluid 1.0 is now publicly available. Http:/ www.cascading.org/fluid. Fluid is an API library exposing the Cascading. Library as a Java fluent interface. And mirrors all of the Cascading concepts without introducing new ones. The Fluid API is generated directly from Cascading compiled libraries and supports all currently supported Cascading final and WIP releases, including Cascading 3.0 WIP. Which provides support for Apache Tez.
cascading.org
Extensions | Cascading
http://www.cascading.org/extensions
The Cascading ecosystem is filled with support for a variety of programming languages, data sources, serializers and tools that extend the functionality of Cascading applications. These extensions are available for use with Cascading and are contributed code from both Concurrent and the Cascading community. Many new projects are actively available through Cascading GitHub. And the Conjars Maven jar repository. From Concurrent, the proven framework for building enterprise data applications. H2 data source...
cascading.org
SDK | Cascading
http://www.cascading.org/sdk
The Cascading Software Development Kit. The fastest way to get started with Cascading and to install Cascading based tools. The SDK includes Cascading 2.7 and related projects in a single archive. Detailed Documentation and GitHub Repository. Tools and Related Projects. See the compatibility chart. For more details. If your distribution is not listed, please contact the vendor to see if they are certifying their distribution for use with Cascading. See the section Redistribution. In the project README.
cascading.org
Fluid | Cascading
http://www.cascading.org/fluid
A Fluent Java API for Cascading. Fluid is an API library exposing the Cascading library as a Java Fluent API. And mirrors all of the Cascading concepts without introducing new ones. The Fluid API is generated directly from Cascading compiled libraries and supports all currently supported Cascading final. Including Cascading 3.0 WIP. Which provides support for Apache Tez. Source code on GitHub. Cascading for the Impatient, Part 6: Fluid example. Vs Original Cascading example. How to Get Started. All Fluid...
cascading.org
Support | Cascading
http://www.cascading.org/support
For general questions and troubleshooting visit the mail list, chat up the developers on IRC, or do it yourself with the source. On irc.freenode.org. GitHub Read the Source. Concurrent offers commercial support and training, and has partnered with a number of consulting companies with vast experience with Cascading and Apache Hadoop. Cascading Enterprise Developer Training. Chart for a list of current compatible and supported Hadoop distributions. Sign-up for Updates *.
cascading.org
Pattern | Cascading
http://www.cascading.org/projects/pattern
Cascading Pattern is an extension to Cascading that provides various machine learning scoring algorithms and a utility for translating Predictive Model Markup Language (PMML) documents into applications on Apache Hadoop. Now you can deploy predictive models on to Hadoop or utilize the Cascading Pattern Java API to deploy your models or sophisticated ensembles. Build Machine Scoring Applications. Quickly deploy machine scoring applications at scale on Apache Hadoop in as little as 4 lines of code. Depende...
cascading.org
December 2014 | Cascading
http://www.cascading.org/2014/12/02/cascading-newsletter-december-2014
Written December 2nd, 2014. Sign-up for Updates *. This field is for validation purposes and should be left unchanged.
cascading.org
April 2015 | Cascading
http://www.cascading.org/2015/04/13/april-2015
Written April 13th, 2015. Sign-up for Updates *. This field is for validation purposes and should be left unchanged.