recentpopularlog in

rdark : distributed   18

aphyr/distsys-class: Class materials for a distributed systems lecture series
This outline accompanies a 12-16 hour overview class on distributed systems fundamentals. The course aims to introduce software engineers to the practical basics of distributed systems, through lecture and discussion. Participants will gain an intuitive understanding of key distributed systems terms, an overview of the algorithmic landscape, and explore production concerns.
aphyr  distributed  github  architecture  systemsw 
march 2019 by rdark
Fun Retro —
retrospectives 🔥
Improve continuously with a simple, intuitive and realtime app
scrum  agile  retro  tools  distributed  remote 
april 2018 by rdark
FoundationDB is Open Source
Starting today, FoundationDB starts its next chapter as an open source project!

FoundationDB is a distributed datastore, designed from the ground up to be deployed on clusters of commodity hardware. These clusters scale well as you add machines, automatically heal from hardware failures, and have a simple API. The key-value store supports fully global, cross-row ACID transactions. That's the highest level of data consistency possible. What does this mean for you? Strong consistency makes your application code simpler, your data models more efficient, and your failure modes less surprising.

The great thing is that FoundationDB is already well-established — it's actively developed and has years of production use. We intend to drive FoundationDB forward as a community project and we welcome your participation.
apple  distributed  database  foundationDB 
april 2018 by rdark
Apache NiFi
Apache NiFi supports powerful and scalable directed graphs of data routing, transformation, and system mediation logic. Some of the high-level capabilities and objectives of Apache NiFi include:
apache  data  distributed  open_source 
june 2017 by rdark
zalando/patroni: Runners to orchestrate a high-availability PostgreSQL
There are many ways to run high availability with PostgreSQL. Here, we present a template for you to create your own customized, high-availability solution using Python and — for maximum accessibility — a distributed configuration store like ZooKeeper or etcd.
etcd  zookeeper  clustering  postgres  distributed  python  patroni  kubernetes 
april 2016 by rdark
micro/micro: A microservice toolkit
Micro is a microservices toolkit. It simplifies writing and running distributed applications.
golang  framework  distributed  microservices 
march 2016 by rdark
sailthru/relay: A self-tuning thermostat for distributed systems that minimizes the error between some metric and a target
Distributed systems need thermostats everywhere! Perhaps you have a need to add workers in proportion to a queue size. Or in another scenario, you may need to add more aws nodes when there's a lot of work to do. Perhaps your grid scheduler needs to maintain a constant number of jobs running at a time, once per node, but the number of nodes is dynamic. You could use Relay to tune hyper-parameters for online machine learning algorithms. Can you think of any applications? If you can't, look at a couple timeseries and you'll come up with good ideas, and there is a good chance that Relay makes solving those quite a bit easier.

In general, Relay is a good candidate for any scenario where you find yourself looking at some metric and then responding to that metric by running some code or tweaking your system.
relay  distributed  control_theory  python 
february 2016 by rdark
sailthru/relay.mesos: A mesos plugin for Relay that lets you auto-scale the number of currently running instances of a bash command
In short, Relay.Mesos runs Relay as a Mesos framework. By combining both of these tools, we can solve control loop problems that arise in distributed systems. An example problem Relay.Mesos might solve is to spin up queue consumers to maintain or minimize a queue size. You could also use Relay.Mesos to set a target CPU usage over time for all instances of a particular task running on your mesos cluster.
autoscale  framework  mesos  distributed  python  relay 
february 2016 by rdark
DRb as a server for long-running web processes
Using Distributed Ruby for running + monitoring of long-running processes.
ruby  sinatra  threading  drb  distributed 
april 2015 by rdark
SkyDNS is a distributed service for announcement and discovery of services built on top of etcd. It utilizes DNS queries to discover available services. This is done by leveraging SRV records in DNS, with special meaning given to subdomains, priorities and weights.
configuration  dns  distributed  docker  etcd  SOI 
january 2015 by rdark
Consul makes it simple for services to register themselves and to discover other services via a DNS or HTTP interface. Register external services such as SaaS providers as well.
configuration  monitoring  distributed  discovery  sysadmin  devops 
may 2014 by rdark
Titan is a distributed graph database optimized for storing and processing large-scale graphs within a multi-machine cluster. The primary features of Titan are itemized below.
databases  distributed  cassandra  hbase  graphing 
june 2012 by rdark
Twitter Engineering: Distributed Systems Tracing with Zipkin
Zipkin is a distributed tracing system that we created to help us gather timing data for all the disparate services involved in managing a request to the Twitter API. As an analogy, think of it as a performance profiler, like Firebug, but tailored for a website backend instead of a browser. In short, it makes Twitter faster. Today we’re open sourcing Zipkin under the APLv2 license to share a useful piece of our infrastructure with the open source community and gather feedback.
testing  infrastructure  distributed 
june 2012 by rdark

Copy this bookmark:

to read