recentpopularlog in
« earlier  
Fauna | Serializability vs “Strict” Serializability: The Dirty Secret of Database Isolation Levels
For many years “serializability” was referred to as the “gold standard” of database isolation levels. It was the highest isolation level offered in the vast majority of commercial database systems (some highly widely-deployed systems could not even offer an isolation level as high as serializability). In this post, we show how serializability is not and never was a “gold standard” for database systems. In fact, strict serializability has always been the gold standard. As we will see,...
concurrency  ACID  database  faunadb  serialization 
yesterday
Faster Retrieval of Top Hits in Elasticsearch with Block-Max WAND | Elastic
Block-max WAND is coming to Lucene 8.0 and Elasticsearch 7.0, and it's bringing serious performance boosts. Learn about this big change and how it impacts you.
elasticsearch  elasticsearch7  top-k 
3 days ago
Easier Relevance Tuning in Elasticsearch 7.0 | Elastic
Elasticsearch 7.0 brings some new tools to make relevance tuning easier. Learn about the new rank_feature and rank_features fields, and Script Score Queries.
elasticsearch  relevance  elasticsearch7 
3 days ago
GoogleCloudPlatform/microservices-demo: Sample cloud-native application with 10 microservices showcasing Kubernetes, Istio, gRPC and OpenCensus.
Sample cloud-native application with 10 microservices showcasing Kubernetes, Istio, gRPC and OpenCensus. - GoogleCloudPlatform/microservices-demo
docker  microservices  kubernetes  istio  gcp  aws  boilerplate  cloud  cloudnative  googlecloud  github  grpc 
8 days ago
PostgreSQL 11 Reestablishes Window Functions Leadership
PostgreSQL 11 offers the best OVER clause support among its competitors, true procedures, and improvements in FETCH FIRST and XML.
postgresql  postgres  databases  postgressql-11  whatsnew 
8 days ago
sveinbjornt/Sloth: Mac app that shows all open files and sockets in use by all running processes. Nice GUI for lsof.
Mac app that shows all open files and sockets in use by all running processes. Nice GUI for lsof. - sveinbjornt/Sloth
github  mac  macos  lsof  tools  security  osx  software  sysadmin  app  filesystem 
8 days ago
eranyanay/1m-go-websockets: handling 1M websockets connections in Go
handling 1M websockets connections in Go. Contribute to eranyanay/1m-go-websockets development by creating an account on GitHub.
networking  golang  websockets  scalability  github  optimization  tips 
8 days ago
fdv/running-elasticsearch-fun-profit: A book about running Elasticsearch
A book about running Elasticsearch. Contribute to fdv/running-elasticsearch-fun-profit development by creating an account on GitHub.
ebook  elasticsearch  scaling  github  operation  ops  jvm  tuning  sysadmin  documentation 
10 days ago
How Three Fundamental Data Structures Impact Storage and Retrieval - DZone Database
CTO of Percona, Vadim Tkachenko, explains the difference between B-Trees, LSM Trees, and Fractal Trees, complete with examples and performance analysis of each method.
database  algorithms  datastructure  btree  LSM  lsm-tree  MySQL  performance  fractal  fractal-tree-index 
16 days ago
Welcome — Statistics Done Wrong
If you’re a practicing scientist, you probably use statistics to analyze your data. From basic t tests and standard error calculations to Cox proportional hazards models and propensity score matching, we rely on statistics to give answers to scientific problems.

This is unfortunate, because statistical errors are rife.

Statistics Done Wrong is a guide to the most popular statistical errors and slip-ups committed by scientists every day, in the lab and in peer-reviewed journals. Man...
book  ebooks  math  statistics  stats  bestpractices  DataAnalysis  datascience 
24 days ago
ProseMirror
A toolkit for building rich-text editors on the web
wysiwyg  editor  markdown  javascript  webdev  collaboration  text  text-editor 
28 days ago
Google Search Operators: The Complete List (42 Advanced Operators)
Do you want to know how to leverage Google for your SEO needs? Check out this list of 42 Google advanced search operators, plus 15 actionable ways to use them to accomplish specific SEO tasks.
query  google  seo  search  operators  reference  advanced  boolean 
29 days ago
yzhao062/pyod: A Python Toolkit for Scalable Outlier Detection (Anomaly Detection)
PyOD is a comprehensive and scalable Python toolkit for detecting outlying objects in multivariate data. This exciting yet challenging field is commonly referred as Outlier Detection or Anomaly Detection. Since 2017, PyOD has been successfully used in various academic researches and commercial products. PyOD is featured for:

Unified APIs, detailed documentation, and interactive examples across various algorithms.
Advanced models, including Neural Networks/Deep Learning and Outlier Ensem...
anomaly  anomaly-detection  machinelearning  python  deeplearning  neuralnetwork  github 
5 weeks ago
MentatInnovations/datastream.io: An open-source framework for real-time anomaly detection using Python, ElasticSearch and Kibana
An open-source framework for real-time anomaly detection using Python, ElasticSearch and Kibana - MentatInnovations/datastream.io
anomaly-detection  anomaly  python  elasticsearch  kibana  stream  timeseries  github 
5 weeks ago
linkedin/luminol: Anomaly Detection and Correlation library
Luminol is a light weight python library for time series data analysis. The two major functionalities it supports are anomaly detection and correlation. It can be used to investigate possible causes of anomaly. You collect time series data and Luminol can:

Given a time series, detect if the data contains any anomaly and gives you back a time window where the anomaly happened in, a time stamp where the anomaly reaches its severity, and a score indicating how severe is the anomaly compare to ...
anomaly  anomaly-detection  linkedin  python  timeseries  library 
5 weeks ago
webpro/awesome-dotfiles: A curated list of dotfiles resources.
A curated list of dotfiles resources. Contribute to webpro/awesome-dotfiles development by creating an account on GitHub.
github  commands  dotfiles  bash  curated  list  cli  linux  unix  howto  shell  awesome 
6 weeks ago
ruiqimao/keyboard-pcb-guide: Guide on how to design keyboard PCBs with KiCad
Guide on how to design keyboard PCBs with KiCad. Contribute to ruiqimao/keyboard-pcb-guide development by creating an account on GitHub.
kicad  github  pcb  keyboard  hardware  design  guide  diy  electronics  CAD 
8 weeks ago
Twitter
Recently Samsung brought Supreme on stage to announce a partnership and a new flagship store in China

The only pro…
10 weeks ago
Learning to rank - Wikipedia
Learning to rank[1] or machine-learned ranking (MLR) is the application of machine learning, typically supervised, semi-supervised or reinforcement learning, in the construction of ranking models for information retrieval systems.[2] Training data consists of lists of items with some partial order specified between items in each list. This order is typically induced by giving a numerical or ordinal score or a binary judgment (e.g. "relevant" or "not relevant") for each item. The ranking model's purpose is to rank, i.e. produce a permutation of items in new, unseen lists in a way which is "similar" to rankings in the training data in some sense.
machinelearning  ranking  wiki  wikipedia 
11 weeks ago
Falcon - A modern high-performance web server for Ruby, supporting HTTP/2 and HTTPS out of the box.
Falcon is a multi-process, multi-fiber rack-compatible HTTP server built on top of async, async-io, async-container and async-http. Each request is run within a lightweight fiber and can block on up-stream requests without stalling the entire server process. Supports HTTP/1 and HTTP/2 natively.
ruby  http  http2  http-server  server  gem  github  library 
october 2018
Awesome asynchronous I/O for Ruby.
Several years ago, I was hosting websites on a server in my garage. Back then, my ADSL modem was very basic, and I wanted to have a DNS server which would resolve to an internal IP address when the domain itself resolved to my public IP. Thus was born RubyDNS. This project was originally built on top of EventMachine, but a lack of support for IPv6 at the time and other problems, meant that I started looking for other options. Around that time Celluloid was picking up steam. I had not encountered actors before and I wanted to learn more about it. So, I reimplemented RubyDNS on top of Celluloid and this eventually became the first stable release.

Moving forward, I refactored the internals of RubyDNS into Celluloid::DNS. This rewrite helped solidify the design of RubyDNS and to a certain extent it works. However, unfixed bugs and design problems in Celluloid meant that RubyDNS 2.0 was delayed by almost 2 years. I wasn't happy releasing it with known bugs and problems. After sitting on the problem for a while, and thinking about possible solutions, I decided to build a small event reactor using nio4r and timers, the core parts of Celluloid::IO which made it work so well. The result is this project.

In addition, there is a similarly designed C++ library of the same name. These two libraries share similar design principles, but are different in some areas due to the underlying semantic differences of the languages.
ruby  async  async-io  io  gem  github  library  c++ 
october 2018
Twitter
The iPhone XS is faster than an iMac Pro on the Speedometer 2.0 JavaScript benchmark. It's the fastest device I've…
september 2018
Auto-squashing Git Commits
git commit --fixup
git commit --squash
git  rebase  convention  fixup  squash 
september 2018
GopherCon 2018 - Allocator Wrestling
A whirlwind tour of the Go memory allocator and garbage collector, with tools and tips on how to optimize.

* The allocator and garbage collector are pretty ingenious!
* Single allocations are fast but not free
* The garbage collector can stop individual goroutines, even though STW pauses are very short
* A combination of tools is essential to understand what's going on: benchmark with GC off
* use CPU profiler to find hot allocations
* use memory profiler to understand allocation count/bytes
* use execution tracer to understand GC pattern
* use escape analyzer to understand why allocations happen
garbagecollection  golang  architecture  gc  overview 
september 2018
Hammerspoon
This is a tool for powerful automation of OS X. At its core, Hammerspoon is just a bridge between the operating system and a Lua scripting engine. What gives Hammerspoon its power is a set of extensions that expose specific pieces of system functionality, to the user.
automation  mac  osx  lua  scripting 
august 2018
vmxdev/tkvdb: Trie key-value database
Trie (radix trie in fact) key-value database

tkvdb is an embedded database library for key-value data. It is similar to Berkeley DB, LevelDB or SQLite4 LSM.

Keys are always sorted in memcmp() order.
database  key-value-store  trie  data-structures  radix-tree  c  programming  github 
august 2018
Twitter
The next chapter: after a 1.5 year "sabbatical" at , I'm thrilled to announce that I'm rejoining in…
august 2018
Twitter
One of the things that surprised me most teaching this week was how useful it was to share my contact sheets - fill…
july 2018
Twitter
The rain finally let up and a family of red foxes went hunting for food. D850 & 600mm + X2 @ 1/640, f/8,…
july 2018
diasks2/pragmatic_segmenter: Pragmatic Segmenter is a rule-based sentence boundary detection gem that works out-of-the-box across many languages.
Pragmatic Segmenter is a rule-based sentence boundary detection gem that works out-of-the-box across many languages.
nlp  sentence  ruby  segmented  rule-based  github  library 
july 2018
« earlier      
per page:    204080120160

Copy this bookmark:





to read