recentpopularlog in

ianweatherhogg : storm   33

Apache Storm 0.9 basic training - Verisign
Apache Storm 0.9 basic training (130 slides) covering: 1. Introducing Storm: history, Storm adoption in the industry, why Storm 2. Storm core concepts: topolog…
slide  presentation  storm  4*  topology 
february 2016 by ianweatherhogg
Apache Storm 0.9 training deck and tutorial - Michael G. Noll
An extensive tutorial and training deck on Apache Storm 0.9
storm  slide 
february 2016 by ianweatherhogg
Realtime Trending Analysis with Approximate Algorithms | Mawazo
When we hear about trending, twitter trending immediately comes to mind. However, there are many other scenarios, where such analysis is applicable. Some example  use cases  are 1. Top 5 videos watched in last 2 hours   2. Top 10 news stories browsed in last 15 minutes 3. Top 10 products that users have interacted with…
storm  stream  processing 
february 2016 by ianweatherhogg
How to spot first stories on Twitter using Storm | Michael Vogiatzis
The code is open-source and available on Github. Discussion on Hacker News As a first blog post, I decided to describe a way to detect first stories (a.k.a new events) on Twitter as they happen.  This work is part of the Thesis I wrote last year for my MSc in Computer Science in the University…
storm  twitter 
july 2015 by ianweatherhogg
Stream processing, Event sourcing, Reactive, CEP… and making sense of it all | Confluent
This is an edited transcript of a talk I gave at /dev/winter 2015. Some people call it stream processing. Others call it Event Sourcing or CQRS. Some even call it Complex Event Processing. Sometimes, such self-important buzzwords are just smoke and mirrors, invented by companies who want to sell you stuff. But sometimes, they contain…
stream  processing  event  akka  data  analytics  apache  tool  spark  storm  query  engine  resource  5*  rx 
february 2015 by ianweatherhogg
Apache Kafka 0.8 basic training - Verisign
Apache Kafka 0.8 basic training (120 slides) covering: 1. Introducing Kafka: history, Kafka at LinkedIn, Kafka adoption in the industry, why Kafka 2. Kafka cor…
kafka  storm  5*  presentation  slide 
august 2014 by ianweatherhogg
A Hadoop Alternative: Building a real-time data pipeline with Storm | Chimpler
With the tremendous growth of the online advertising industry, ad networks have to deal with a humongous amount of data to process. For years, Hadoop has been the de-facto technology used to aggregate data logs but although it is efficient in processing big batches, it has not been designed to deal with real-time data. With…
storm  helloworld 
july 2014 by ianweatherhogg
Implementing a real-time data pipeline with Spark Streaming | Chimpler
Real-time analytics has become a very popular topic in recent years. Whether it is in finance (high frequency trading), adtech (real-time bidding), social networks (real-time activity), Internet of things (sensors sending real-time data), server/traffic monitoring, providing real-time reporting can bring tremendous value (e.g., detect potential attacks on network immediately, quickly adjust ad campaigns, ...). Apache Storm…
spark  storm  stream  processing  kafka 
july 2014 by ianweatherhogg
storm-vagrant - Vagrant config to create a virtualized Storm cluster
github  vagrant  storm 
february 2014 by ianweatherhogg

Copy this bookmark:

to read