recentpopularlog in

ianweatherhogg : kafka   49

semantalytics/awesome-kafka
Contribute to semantalytics/awesome-kafka development by creating an account on GitHub.
awesome  kafka 
november 2018 by ianweatherhogg
dharmeshkakadia/awesome-kafka: Everything about Apache Kafka
Everything about Apache Kafka. Contribute to dharmeshkakadia/awesome-kafka development by creating an account on GitHub.
awesome  kafka 
november 2018 by ianweatherhogg
Scaling Blockchains with Apache Kafka – Grid+
I’ve been hearing a lot about the “Internet of Blockchains”, which will likely be the up-and-coming trend of 2018. It seems like many people in the crypto sphere have become enamored with a foregone…
kafka  block  chain 
august 2017 by ianweatherhogg
Testing Topologies in Kafka Streams
Kafka Streams is a deployment-agnostic stream processing library written in Java. Even though Kafka has a great test coverage, there is no helper code for writing unit-tests for your Kafka Streams topologies. I wrote a little helper library Mocked Streams in Scala, which allows you to create lightweight parallelizable unit-tests for your topologies without running a full Kafka cluster neither an embedded one.
spark  kafka  stream  processing  test  topology 
february 2017 by ianweatherhogg
Stateful Streaming in Spark and Kafka Streams
This article is about aggregates in stateful stream processing. It covers two concrete examples in Apache Spark and Apache Kafka.
spark  kafka  stream  processing  analytics 
february 2017 by ianweatherhogg
Spark and Spark Streaming Unit Testing - Passionate Developer
When you develop distributed system, it is crucial to make it easy to test.
Execute tests in controlled environment, ideally from your IDE.
Long …
spark  stream  test  kafka 
february 2016 by ianweatherhogg
Spark and Kafka Integration Patterns, Part 2 - Passionate Developer
In the world beyond batch,
streaming data processing is a future of dig data.
Despite of the streaming framework using for data processing, tight …
kafka  spark 
february 2016 by ianweatherhogg
How to Build a Scalable ETL Pipeline with Kafka Connect
A tutorial on how to use Kafka Connect, together with the JDBC and HDFS connectors, to build a scalable data pipeline in 30 minutes.
kafka  mysql  hadoop  hive 
february 2016 by ianweatherhogg
Questioning the Lambda Architecture - O'Reilly Radar
Nathan Marz wrote a popular blog post describing an idea he called the Lambda Architecture (How to beat the CAP theorem). The Lambda Architecture is an approach to building...
stream  processing  kafka 
february 2016 by ianweatherhogg
Apache Kafka on Docker - Mastering FP and OO with Scala
Apache Kafka has always been high on my list of things to explore, but since there are quite a few things high on my list, Kafka couldn’t …
kafka  docker 
october 2015 by ianweatherhogg
Blog Archive - Random Thoughts on Coding
Blog Archive 2015 Spark and Guava Tables
Oct 09 2015 posted in Hadoop, MapReduce, Scala, Spark Secondary Sorting in Spark
Oct 02 2015 posted in …
blogs  hadoop  spark  kafka  stream  processing  guava 
october 2015 by ianweatherhogg
Putting Apache Kafka To Use: A Practical Guide to Building a Stream Data Platform (Part 1) | Confluent
These days you hear a lot about "stream processing", "event data", and "real-time", often related to technologies like Kafka, Storm, Samza, or Spark's Streaming module. Though there is a lot of excitement, not everyone knows how to fit these technologies into their technology stack or how to put it to use in practical applications. This guide…
kafka  stream  event  processing 
may 2015 by ianweatherhogg
Apache Kafka 0.8 basic training - Verisign
Apache Kafka 0.8 basic training (120 slides) covering: 1. Introducing Kafka: history, Kafka at LinkedIn, Kafka adoption in the industry, why Kafka 2. Kafka cor…
kafka  storm  5*  presentation  slide 
august 2014 by ianweatherhogg
Implementing a real-time data pipeline with Spark Streaming | Chimpler
Real-time analytics has become a very popular topic in recent years. Whether it is in finance (high frequency trading), adtech (real-time bidding), social networks (real-time activity), Internet of things (sensors sending real-time data), server/traffic monitoring, providing real-time reporting can bring tremendous value (e.g., detect potential attacks on network immediately, quickly adjust ad campaigns, ...). Apache Storm…
spark  storm  stream  processing  kafka 
july 2014 by ianweatherhogg
Apache Kafka
Apache Kafka: A high-throughput, distributed, publish-subscribe messaging system.
kafka  distributed  failover  cluster  stream  processing  documentation  5* 
march 2014 by ianweatherhogg

Copy this bookmark:





to read