recentpopularlog in

tobym : database   90

« earlier  
Tarantool Real-time data integration platform for the Enterprise
Tarantool is a Lua application server integrated with a database management system. BSD-licensed, very efficient. Used by at high scale.
database  app  scalability  optimization  webapp  framework  server 
20 days ago by tobym
mit-pdos/noria: Dynamically changing, partially-stateful data-flow for web application backends.
Very roughly compare to flexvies, pipelinedb, ksql. Code from noria paper in OSDI'18.
streaming  database 
5 weeks ago by tobym
FoundationDB | Home
FoundationDB gives you the power of ACID transactions in a distributed database
distributed  nosql  database 
11 weeks ago by tobym
500 Lines or Less | Dagoba: an in-memory graph database
Interesting super-simple graph db for educational purposes
graph  graphdb  database  db  education 
11 weeks ago by tobym
Tile38 - Ultra Fast Geospatial Database & Geofencing Server
Neat geo database that works with streaming well. API similar to Redis. Tile38 delivers geospatial event notifications for mission-critical applications by pairing with external webhooks and event queues.

Compare sort of with PostGIS.

There's built-in support for the most popular tools, including Kafka, SQS, MQTT, RabbitMQ, Redis.

MIT license
geo  database  opensource  geofence 
march 2019 by tobym
Home | PingCAP
Hybrid OLTP/OLAP database, MySQL compatible protocol. Is distributed, uses Raft for consensus on the OLTP side, and uses Spark for the OLAP side (with each side sharing the same underlying storage).
database  mysql  oltp  olap  distributed 
january 2019 by tobym
ClickHouse — open source distributed column-oriented DBMS
ClickHouse is an open source column-oriented database management system capable of real time generation of analytical data reports using SQL queries.

Can be used for log/event data with MergeTree engine (date field required), some similarities to BigQuery with its day-only partitioning.

Compare to Redshift, Vertica, BigQuery.
opensource  analytics  database  bigdata 
january 2019 by tobym
noria - Rust
Noria is a new streaming data-flow system designed to act as a fast storage backend for read-heavy web applications based on this paper from OSDI'18. It acts like a databases, but pre-computes and caches relational query results so that reads are blazingly fast. Noria automatically keeps cached results up-to-date as the underlying data, stored in persistent base tables change. Noria uses partially-stateful data-flow to reduce memory overhead, and supports dynamic, runtime data-flow and query change.

Research-level implementation, so basically proof-of-concept at this point. Not a drop-in replacement for anything but the most basic mysql features. Interesting concepts though.

db  database 
november 2018 by tobym
SpatiaLite: SpatiaLite
SpatiaLite is an open source library intended to extend the SQLite core to support fully fledged Spatial SQL capabilities.

SQLite + SpatiaLite is roughly equivalent to PostgreSQL + PostGIS.
geo  gis  sqlite  extension  database 
october 2018 by tobym
DBToaster - Welcome to
Engine for continuous analytical queries. Sounds similar to PipelineDB (, Noria, maybe KQSL.

DBToaster is an SQL-to-native-code compiler. It generates lightweight, specialized, embeddable query engines for applications that require real-time, low-latency data processing and monitoring capabilities. The DBToaster compiler generates code that can be easily incorporated into any C++ or JVM-based (Java, Scala, ...) project.

Since 2009, DBToaster has spearheaded the currently ongoing database compilers revolution. If you are looking for the fastest possible execution of continuous analytical queries, DBToaster is the answer. DBToaster code is 3-6 orders of magnitude faster than all other systems known to us.
database  olap  analysis  analytics 
october 2018 by tobym
Materialized views with PostgreSQL for beginners – JobTeaser Tech – Medium
use a unique index on the materialized view, then you can refresh concurrently
june 2018 by tobym
Modern database interface for Vim
vim  plugin  db  database 
march 2018 by tobym
Scuttlebot peer-to-peer log store
Scuttlebot is an open source peer-to-peer log store used as a database, identity provider, and messaging system. It features global replication, file-syncronization, and end-to-end encryption.

Each user feed is its of blockchain to ensure state converges.
distributed  decentralized  p2p  messaging  blockchain  database  sneakernet 
november 2017 by tobym
Wikidata is a free and open knowledge base that can be read and edited by both humans and machines.

Wikidata acts as central storage for the structured data of its Wikimedia sister projects including Wikipedia, Wikivoyage, Wikisource, and others.

Wikidata also provides support to many other sites and services beyond just Wikimedia projects! The content of Wikidata is available under a free license, exported using standard formats, and can be interlinked to other open data sets on the linked data web.
data  opendata  wiki  graph  database 
august 2017 by tobym
Mnesia And The Art of Remembering | Learn You Some Erlang for Great Good!
A layer on top of Erlang's ETS/DETS (key value store in the erlang runtime...erlang term storage, and disk-based term storage). Mnesia adds transactions, queries, and distribution.
erlang  database 
july 2017 by tobym
Dgraph: Fastest graph database in the market
Meet Dgraph — an open source, scalable, distributed, highly available and fast graph database, designed from ground up to be run in production.

Still an alpha production as of Jul 5, 2017. Wrote their own storage backend "Badger" which is basically RocksDB (the original storage engine).
graphdb  graph  database 
july 2017 by tobym
JanusGraph: Distributed graph database
JanusGraph is a scalable graph database optimized for storing and querying graphs containing hundreds of billions of vertices and edges distributed across a multi-machine cluster. JanusGraph is a transactional database that can support thousands of concurrent users executing complex graph traversals in real time.

Based off the original Titan graph db codebase, and supported by the Linux foundation.
graphdb  distributed  database 
july 2017 by tobym
MapDB - MapDB
MapDB provides Java Maps, Sets, Lists, Queues and other collections backed by off-heap or on-disk storage. It is a hybrid between java collection framework and embedded database engine. It is free and open-source under Apache license.
java  database  library 
june 2017 by tobym
Symas Lightning Memory-mapped Database | Symas Corporation
Compare with BDB, LevelDB, RocksDB, Kyoto TreeDB. Apparently must faster than bloated RocksDB.
june 2017 by tobym
AgensGraph · Online Transactional Multi-model Graph Database
Graph db built on Postgres; supports SQL and openCypher languages.
graph  database  postgres  graphdb 
june 2017 by tobym
ORM - Skinny Framework
Scala ORM modeled on Ruby's ActiveRecord.
scala  orm  database  framework 
may 2017 by tobym
BigchainDB • • The scalable blockchain database.
The scalable blockchain database. "Ethereum as logic, IPFS as disk and BigchainDB as database"
blockchain  db  database  distributed 
may 2017 by tobym
pglogical | 2ndQuadrant
pglogical is the next generation replication system for PostgreSQL

pglogical is a logical replication system implemented entirely as a PostgreSQL extension. Fully integrated, it requires no triggers or external programs. This alternative to physical replication is a highly efficient method of replicating data using a publish/subscribe model for selective replication.

- migrate and upgrade PostgreSQL with almost zero downtime
- accumulate changes from sharded database servers into a data warehouse
- scale out: copy all or a selection of database tables to other nodes in a cluster
- integrate: feed database changes in real-time to other systems
postgresql  database  replication 
april 2017 by tobym
Postgres-XL | 2ndQuadrant
Postgres-XL is a massively parallel database built on top of, and very closely compatible with PostgreSQL 9.5. It is different because it supports both Business Intelligence workloads and high-volume transactional write and read workloads all on the same platform. 

Postgres-XL is designed to be horizontally scalable and flexible enough to handle various workloads including:
- OLTP write-intensive workloads
- Business Intelligence requiring OLAP with MPP parallelism
- Operational data store
- Key-value store including JSON
- GIS Geospatial
- Mixed-workload environments
postgresql  distributed  database 
april 2017 by tobym
Postgres-BDR | 2ndQuadrant
Bi-Directional Replication for PostgreSQL (Postgres-BDR, or BDR) is the first open source multi-master replication system for PostgreSQL to reach full production status, developed by 2ndQuadrant and assisted by an active user community. BDR is specifically designed for use in geographically distributed clusters, using highly efficient asynchronous logical replication, supporting anything from 2 to more than 48 nodes in a distributed database.
postgresql  distributed  database  ha 
april 2017 by tobym
Lightning Memory-Mapped Database - Wikipedia
key/value database library with transactions and concurrency support.
Compare with RocksDb.
nosql  database  key-value 
march 2017 by tobym
RocksDB | A persistent key-value store | RocksDB

Extension of LevelDb from Google including geospatial features and more optimization and speed.
db  key-value  database  nosql 
march 2017 by tobym
ArangoDB - highly available multi-model NoSQL database
Production ready highly available Multi-Model NoSQL database
nosql  graphdb  graph  database 
february 2017 by tobym
Another metrics database. Compare with InfluxDB, or the Cassandra backend for Graphite. Or to the ancestors, whisper and rrd.
metrics  analytics  database  riak 
february 2017 by tobym
Just write SQL and get things done!
scala  sql  jdbc  database  library 
december 2016 by tobym
KeRF - Kerf software
impressive marketing on the page; closed source though and targeted towards financial institutions
timeseries  analytics  database 
april 2016 by tobym - Prof. Jens Dittrich
Collection of information, including videos, about database systems. E.g. data layouts, indexes, query processing algorithms, query planning and optimization, recovery, and big data/mapreduce/hadoop.
july 2015 by tobym
Sqitch by theory
DB migration manager, inspired by git.
database  db  sql 
may 2015 by tobym
Aerospike at Tapad
My latest upload : Aerospike at Tapad on via
adtech  database 
august 2014 by tobym
Open-source graph database written in Go. Supports gremlin-like and Freebase's MQL-like query languages, has multiple backend stores like LevelDB or Mongo.
graph  database  golang 
june 2014 by tobym
Presto | Distributed SQL Query Engine for Big Data
Distributed SQL Query Engine for Big Data. Facebook uses this.
bigdata  database  sql 
november 2013 by tobym
Shark - Lightning Fast Data Warehouse System
Shark is a large-scale data warehouse system for Spark designed to be compatible with Apache Hive. It can answer Hive QL queries up to 30 times faster than Hive without modification to the existing data nor queries. Shark supports Hive's query language, metastore, serialization formats, and user-defined functions.
shark  spark  hive  hadoop  database  datawarehouse  analytics 
october 2012 by tobym
Works by executing SQL queries over encrypted data using a collection of efficient SQL-aware encryption schemes. DB admin doesn't have access to unencrypted data.
encryption  mysql  database 
december 2011 by tobym
leveldb - a fast and lightweight key/value database library - Google Project Hosting
LevelDB is a library that implements a fast persistent key-value store.
c++  opensource  database  library 
may 2011 by tobym
A Graph Processing Stack
Rexter gives Blueprints-enabled graphs a REST interface automatically. What does this mean for neo4j-rest?
graph  database 
december 2010 by tobym
CMPH - C Minimal Perfect Hashing Library
good for hashing to known sets on the order of a billion keys. Nokia's DiscoDB uses this. Contrast with normal B+ tree indexes which are best when there are frequent inserts and deletes.
c  hash  opensource  datastructures  database 
october 2010 by tobym
cdb - "constant database"
fast simple package for creating and reading constant databases. 4-gb limit, but no requirement to fit in memory. used in qmail
cdb  database  datastore  c 
october 2010 by tobym
List of nosql databases
Proving that there are too many of these
nosql  database  list 
may 2010 by tobym
« earlier      
per page:    204080120160

Copy this bookmark:

to read