recentpopularlog in

aries1988 : data   127

« earlier  
Homo sapiens devient-il homo informaticus ?
oisif, oisive

adjectif et nom
(ancien français oidif, avec l'influence de oiseux)

Qui n'exerce aucune activité permanente et dispose de nombreux loisirs, qui vit sans travailler.
human  workforce  future  book  français  opinion  work  self  ai  data 
september 2018 by aries1988
chenqin - 知乎
有问题,上知乎。知乎是中文互联网知名知识分享平台,以「知识连接一切」为愿景,致力于构建一个人人都可以便捷接入的知识分享网络,让人们便捷地与世界分享知识、经验和见解,发现更大的世界。
data 
september 2018 by aries1988
长租公寓拉涨了北京房租,上海会是下一个吗?
作者:chenqin 自如、蛋壳等长租公寓,在北京的房租上升里扮演了一个什么角色?大部分观点认为,他们是房租上涨的罪魁祸首;也有一小部分观点认为,房租上涨只是市场行为。本文针对这个问题进行一些简单研究。 首先感谢 @城市数据团…
beijing  shanghai  appartement  comparison  data 
september 2018 by aries1988
About the GSL
The General Service List (GSL) (West. 1953) is a set of 2,000 words selected to be of the greatest "general service" to learners of English. They are not the most common 2,000 words, though frequency was one of the factors taken into account in making the selection. Each of the 2,000 words is a headword representing a word family that is only loosely defined in West. Frequency numbers are given, derived from Thorndike and Lorge (1944). Frequency data is also given for the various meanings of words. This list has had a wide influence for many years, serving as the basis for graded readers as well as other material. Texts based on the GSL are still on sale, but the list itself is out of print. A fuller discussion of the GSL, and word lists in general, can be found in Nation (1990, pp 21-24) and Carter and McCarthy (1988, Ch. 1)
english  vocabulary  list  data 
december 2017 by aries1988
Hans Rosling, physician and statistician, 1948-2017

Rosling, who has died aged 68, was the closest thing statisticians had to a rock star. His most famous talk, 2006’s The Best Stats You’ve Ever Seen, has been watched online more than 11m times. Its ambitious scope and sweeping narrative epitomised Rosling’s ability to rise above the ebb and flow of current affairs and see generational trends.

It led him to conclude that on most measures of human progress — the impact of climate change being a notable exception — most countries were improving rapidly.

Public perceptions had not kept up with the pace of economic and demographic development and much of the media was hobbled by its adherence to false balance, he said. The world is discussed in terms of feelings and ideologies rather than as an area of knowledge, he once told the Financial Times.
leader  statistics  communication  science  world  data  debate  politics  ideology  optimism  explained  population 
december 2017 by aries1988
Be careful with social media — employers are watching
The role social media plays in HR matters raises even bigger questions. Facebook and Twitter have been open for 11 years. That means a whole generation, now graduating into the workplace, has its entire history online forever, as preserved as an insect in amber.

The digital realm covers millennials’ exact geographical locations, their views, relationship history, consuming preferences, their social and work life. I am told that a determined stalker could discover how you vote, an approximation of your medical history, your financial and legal trails and much more.
social-network  data  hr  career 
october 2017 by aries1988
A Big Brother approach has qualities that would benefit society
Based on the “citizen score”, the Chinese state will be able to improve — or restrict — such privileges as high broadband speed, foreign travel visas, social benefits, access to elite restaurants, favourable insurance premiums and the quality of schooling offered to a person’s children.

Social media posts praising the government, Communist party and the economy, Ms Botsman says, will enhance your rating.

Yet, I am almost embarrassed to say, I get it. Bearing in mind China’s violent history, I understand its preoccupation with order and harmony, and how the omniscient potential of computing, the internet and mobile data has been irresistible to the political intelligentsia.

We have indeed all the elements that make China’s Social Credit system possible. Pull together the credit record of an individual or business as well as their social media posts, browser history, tax record, criminal record, fitness statistics, supermarket loyalty card details, Yelp, eBay, TripAdvisor, feedback and you have the same thing.
bigdata  data  government  privacy  society  utopia 
october 2017 by aries1988
錢鋼語象報告:十九大之後,什麼是「新時代」的關鍵詞?|端傳媒 Initium Media
在黨權、鬥爭等方面,十九大報告的「毛式話語」力度不降反升,或許這正是「新時代中國特色」:集一黨之權、舉國之力,統一意志和步伐,壓制異見,清除路障,高效建設美麗新世界。
data 
october 2017 by aries1988
Branded in Memory
The bite, which is the logo's most iconic feature, was included for scale, so the apple wouldn't be mistaken for a cherry.

Starbucks' current logo, introduced in 2011, is a streamlined version of the two-tailed siren. It no longer features the "Starbucks Coffee" text and is pure green, as opposed to green and black. Despite this simplification, only 6 percent of people drew a near perfect Starbucks logo from memory.

The question at the heart of this experiment is "How accurately can we recall logos we see on a daily basis?" The results show that most people are very good at recalling brand colors – around 80 percent selected the correct palettes for their drawings, while shapes and elements in logos are harder to recall.

There was, however, a difference by age. On average, younger people drew more accurate logos than older people. This was true across almost all brands, but was most noticeable for 7-Eleven, Burger King, and Adidas. Walmart was least affected by age, showing no difference between younger and older groups.

Confidence scaled with accuracy, so while people in general overestimated how well they did, those who did best had the best awareness of their ability.
logo  design  advertising  fun  comparison  data  memory  poll  infographics  analysis 
october 2017 by aries1988
Roughness Map - WAsP - Global Mapper Forum
There isn't a special solution for this yet (we were kind of hoping WaSP would make their handling of roughness areas easier to use to avoid a messy workaround, but I don't know if this will happen). The best workaround for now is to create the buffers with a small negative distance to shrink the roughness areas so they don't touch each other since this is the problem. Really with roughness areas the ROUGH_L and ROUGH_R don't make sense as you really just need a roughness value inside the area and other areas can cover the outside areas, but Wasp doesn't quite work that way. Because of this the ROUGH_R value will be the roughness inside the area (for clockwise areas) and you can then edit the ROUGH_L to be an invalid roughness for the very tiny stripe between areas after shrinking them.
explained  tool  terrain  data  md  ofm 
october 2017 by aries1988
Sig WindE/Validation Cases - OpenFOAMWiki
The aim of this page is to hold validation cases for ABL (Atmospheric Boundary Layer) flows that are of interest to anybody starting to model ABL flows in OF. The simulations I perform, and refer to mainly in this text are RANS with k-epsilon turbulence closure models, using simpleFoam. Most of the first simulations that are placed in this page were done according to the excellent Martinez 2011 Ms.C. thesis [1] All of the boundary conditions and mesh creation suggestions were followed by from his work and the references within it.
validation  example  tool  data  ofm 
october 2017 by aries1988
Shuttle Radar Topography Mission - Wikipedia
The Shuttle Radar Topography Mission (SRTM) is an international research effort that obtained digital elevation models on a near-global scale from 56° S to 60° N,[2] to generate the most complete high-resolution digital topographic database of Earth prior to the release of the ASTER GDEM in 2009. SRTM consisted of a specially modified radar system that flew on board the Space Shuttle Endeavour during the 11-day STS-99 mission in February 2000, based on the older Spaceborne Imaging Radar-C/X-band Synthetic Aperture Radar (SIR-C/X-SAR), previously used on the Shuttle in 1994. To acquire topographic data, the SRTM payload was outfitted with two radar antennas.[2] One antenna was located in the Shuttle's payload bay, the other – a critical change from the SIR-C/X-SAR, allowing single-pass interferometry – on the end of a 60-meter (200-foot) mast that extended from the payload bay once the Shuttle was in space.[2] The technique employed is known as interferometric synthetic aperture radar.
radar  data  earth  terrain  md 
september 2017 by aries1988
Where to download free elevation data for Europe? - Geographic Information Systems Stack Exchange
The 3-arc second (~90 meter) SRTM covers all of Europe to 60° N, has good positional accuracy and is part of the public domain
data  explained  geography  md 
september 2017 by aries1988
New to the Archaeologist’s Tool Kit: The Drone

Dr. Castillo’s eureka moment occurred in 2012, while teaching in Sweden, where researchers were working with a powerful Russian-made computer program that could meld hundreds of photographs into a 3-D composite image. Dr. Castillo realized that by feeding his drone photographs into the program, he could produce incredibly detailed and clear 3-D images of ancient temples, fortifications and burial sites.
data  drone  archaeology  technology  today  map 
august 2017 by aries1988
The ‘time machine’ reconstructing ancient Venice’s social networks
Venice is the perfect city for the experiment because of its wealth of historically important, well-ordered documentation. It was founded in the fifth century AD by citizens of the Roman empire escaping barbarian invaders from the north. Its inhospitable lagoons provided much-needed protection, and its location at the north end of the Adriatic Sea also had strategic advantages. It soon became the most important trading post between Western Europe and the east, bringing it riches and power.
history  archive  data  infographics  project  archaeology  commerce  research  society  social-network 
june 2017 by aries1988
Your Search History Shows Your Secret Side
What our Google searches reveal about us. It’s not always the face we show to the world.
podcast  data  google  howto 
may 2017 by aries1988
How to Become a Data Scientist - On your own

Here is my cheat sheet of becoming a Data Scientist through your own efforts
list  howto  job  data  online  mooc  resource  programming  tutorial  moi  datascience 
may 2017 by aries1988
Shaking Up the Dinosaur Family Tree

After analyzing 32 billion trees, the computer spat out the best possible arrangement of Mr. Baron’s three years’ worth of data collection. The run took just five minutes.
taxonomy  dinosaur  data  theory  discovery  biology 
march 2017 by aries1988
Is your commute worth the pain? The real cost of getting to work
https://www.instapaper.com/read/885035373

Xinzhuang

When suburbs sprawl out too far and fail to stay compact, he says, then “even if [the authorities] wanted to provide good public transport services, it will end up more and more costly.”

Urawa
city  life  moi  analysis  data  shanghai  tokyo  work  comparison  world  time  immobilier  price  commute 
march 2017 by aries1988
How statistics lost their power – and why we should fear what comes next | William Davies

In France, it has been illegal to collect census data on ethnicity since 1978, on the basis that such data could be used for racist political purposes. (This has the side-effect of making systemic racism in the labour market much harder to quantify.)

Speaking scientifically about the nation – for instance in terms of macroeconomics – is an insult to those who would prefer to rely on memory and narrative for their sense of nationhood, and are sick of being told that their imagined community does not exist.

the geography of capitalism has been pulling in somewhat different directions. Plainly globalisation has not rendered geography irrelevant. In many cases it has made the location of economic activity far more important, exacerbating the inequality between successful locations (such as London or San Francisco) and less successful locations (such as north-east England or the US rust belt). The key geographic units involved are no longer nation states. Rather, it is cities, regions or individual urban neighbourhoods that are rising and falling.

Immigration may be good for the economy overall, but this does not mean that there are no local costs at all. So when politicians use national indicators to make their case, they implicitly assume some spirit of patriotic mutual sacrifice on the part of voters: you might be the loser on this occasion, but next time you might be the beneficiary.

Why then do the events of the past year feel quite so damaging to the ideal of quantitative expertise and its role in political debate?

Statistics, collected and compiled by technical experts, are giving way to data that accumulates by default, as a consequence of sweeping digitisation. Traditionally, statisticians have known which questions they wanted to ask regarding which population, then set out to answer them.

In this new world, data is captured first and research questions come later.

First, there is no fixed scale of analysis (such as the nation) nor any settled categories (such as unemployed).

Second, the majority of us are entirely oblivious to what all this data says about us, either individually or collectively.

What is most politically significant about this shift from a logic of statistics to one of data is how comfortably it sits with the rise of populism.

These data analysts are often physicists or mathematicians, whose skills are not developed for the study of society at all.

During the presidential election campaign, Cambridge Analytica drew on various data sources to develop psychological profiles of millions of Americans, which it then used to help Trump target voters with tailored messaging.

The new apparatus of number-crunching is well suited to detecting trends, sensing the mood and spotting things as they bubble up. It serves campaign managers and marketers very well. It is less well suited to making the kinds of unambiguous, objective, potentially consensus-forming claims about society that statisticians and economists are paid for.
statistics  expert  data  crisis  opinion  conflict 
january 2017 by aries1988
我们究竟需要多大的词汇量?
从统计图中可以看出,《经济学人》的36万多词文本中,只有 92.57% 的单词落在BNC词表中前8000词区间中。也就是说,假如你的词汇量是8000,那么你只能读懂《经济学人》文本的92.57%,按照上面的词汇覆盖率标准,这样的覆盖率并不能保证顺利理解文本。如果你的词汇量达到了10000,那么你大概可以读懂 95.05%的《经济学人》,基本可以理解文本内容。如果词汇量达到了15000,你的词汇覆盖率会上升到97.92%,基本达到了充分理解的程度。

由此看来,“掌握几千个单词就够用”这种说法在阅读中并不靠谱。即使你词汇量达到8000,读起野生外刊仍然会捉襟见肘。如果想要比较舒服地读懂外刊,那么你的词汇量最好不要低于12000。

那么,熟练掌握几千个单词对听力够用吗?答案也是否定的。认真听过BBC以及VOA的同学应该会留意到,这些听力材料的难度并不比原版外刊低多少,要听懂它们仍然需要10000以上的词汇量。
vocabulary  study  english  tool  to:things  numbers  question  howto  data  analysis 
december 2016 by aries1988
A century of earthquakes
A recent earthquake reduced large parts of central Italy to a pile of rubble and killed almost 270 people. Though shallow, the tremor was felt in buildings as far as Rome and Florence. At 6.2 magnitude, it was the second strongest in 35 years on the peninsula.
data  geology  earthquake  history 
august 2016 by aries1988
How drones are learning to find their own way in the world | New Scientist
It previously took hours of postflight processing to build maps from drone footage. The bottleneck was processing power. The team overcame this by harnessing the power of GPUs – fast image-processing chips originally developed for games.
data  drone  GPS  technology  map  mapping 
july 2016 by aries1988
Baidu uses millions of users' location data to make predictions | New Scientist
The researchers hand-labelled thousands of areas of interest – offices, shopping centres and industrial zones – across China. Then they studied location data from the end of 2014 to the middle of 2016 to see how many people were at those places at each time, and how that changed through the year.
data  baidu  study  chinese  economy  opinion 
july 2016 by aries1988
Music in the age of the algorithm — FT.com
Distinction increasingly resides in the range of our listening, its eclecticism, not a narrow attachment to genre. Yet without a map, some sense of what we like and dislike, how do we make sense of the expanse of music stretching around us?
book  consumer  habit  data  music  online  evolution  spotify 
june 2016 by aries1988
Who's downloading pirated papers? Everyone | Science | AAAS
The United States is the fifth largest downloader after Russia, and a quarter of the Sci-Hub requests for papers came from the 34 members of the Organization for Economic Cooperation and Development, the wealthiest nations with, supposedly, the best journal access. In fact, some of the most intense use of Sci-Hub appears to be happening on the campuses of U.S. and European universities.
data  numbers  piracy  academia 
may 2016 by aries1988
Why so much science research is flawed – and what to do about it
Some fields of research are less susceptible than others, though. In astronomy, chemistry and physics, for instance, people have a very strong tradition of sharing data, and of using common databases like big telescopes or high energy physical experiments, Ioannidis says. They are very cautious about making claims that eventually will be refuted. But in fields where such checks and balances are absent, irreproducible results are rife.
science  academia  data  debate  ethic  scientist 
april 2016 by aries1988
Comment faire pour créer des liens hypertexte absolus et relatifs dans des documents Word 2003 et Word 2007
Comment faire pour utiliser des liens hypertexte absolus au lieu de liens hypertexte relatifs
office  data  excel 
april 2016 by aries1988
The Dream World Of ‘The Good Dinosaur’ Is Based On USGS Surveys
“When Arlo’s in danger, you feel in danger. The environment is really the antagonist in the film.” Believe it or not, Pixar’s “The Good Dinosaur” — which cente…
geography  animation  data 
january 2016 by aries1988
The Economic Guide To Picking A College Major
There may also be a built-in risk premium: The energy industry is notorious for its boom-bust volatility, so there’s no guarantee that those newly minted petroleum engineers will still be pulling down six-figure salaries if oil prices suddenly crater.

In fact, there’s almost no correlation between how popular a major is and how lucrative it is. Psychology is far and away the most popular major despite paying a below-average median wage of $31,500. Highly paid engineering fields, meanwhile, are among the least popular fields of study.
data  ranking  university  job  choice  future  education  moi 
august 2015 by aries1988
Decades of Facebook likes will explain how you became yourself
According to a research group in the UK, it turns out that what people choose to “like” on Facebook can be used to determine with 95% accuracy whether they are Caucasian or African American, 88% accuracy whether they are gay or straight, and 65% accuracy whether they are a drug user, among other things.

We’re already tackling life history questions based on Facebook likes. For example, did your parents get divorced before they were 21, they can unlock that with 60% certitude. Given that it’s only a few years’ worth of likes, imagine that it’s in five or 10 years and there’s that much more data to go on, and people are revealing their lives through their smartphones and their laptops.”
data  prediction  behavior  Facebook 
september 2014 by aries1988
新词发现之爆笑NBA | 简书
新词发现的代码已经托管在github:new-words-discovery。目前能算出候选词的凝固度和自由度,尚需手动设置阈值进行刷选。
data  fun 
june 2014 by aries1988
Why Websites Still Can’t Predict Exactly What You Want - Kaiser Fung - Harvard Business Review
All that data and still an underwhelming result. What’s happening here? It seems that it’s a matter of how the companies position personalization. They regard it as a tool for upselling–they want to push us out of our comfort zone, to buy new things, and to buy more things. To achieve that goal, the companies can’t just look at one’s historical browsing or purchase patterns. Instead, data scientists look for traits in similar customers. When you position personalization this way, you build algorithms that are based on finding variables.

But the examples I cited above about the shoes, and the bank, and the travel, are based on invariables such as physical attributes (the size of your feet), cyclical life events (paying bills), or habits (brand loyalty). The beauty of such measures is the chances of matching user to need is much higher than the chances when predicting needs based on variant behaviors.
data  consumer  today 
june 2014 by aries1988
干货:详解大数据足以引领第四次革命的商业价值
我们知道:

第一次工业革命以煤炭为基础,蒸汽机和印刷术为标志,
第二次工业革命以石油为基础,内燃机和电信技术为标志,
第三次工业革命以核能基础,互联网技术为标志,
第四次工业革命以可再生能源为基础,_________为标志。
from:rss  data 
may 2014 by aries1988
Marseille reste la ville la plus embouteillée de France
Lyon se situe en 3e position du classement avec un taux de congestion de 31 %. «Sur un trajet de 30 minutes en conditions de circulation optimales, un conducteur perd en moyenne 39 minutes par jour lorsqu'il effectue ce trajet aller-retour aux heures de pointe du matin et du soir et comme à Paris, la situation s'est détériorée puisque ce temps a augmenté de 3 minutes par rapport à l'an dernier», commentent les auteurs de l'étude.
france  city  life  car  data  comparison 
november 2013 by aries1988
Draft: From Zero To Kickstarter In $5k | Need/Want
We decided on a hard cover, so you can throw it in your bag without worrying about it. Super thick paper means when you’re writing a note, it never bleeds through to the next page (this would also be important during scanning!). We also made them the exact size of the iPad mini, as a lot of people will probably be carrying these things together anyway.

We needed to work out how we were going to position this. We’d played around with A/B testing headlines on Facebook with Smart Bedding, but it can get pretty expensive if you want a decent-sized pool of data.

Incidentally, if you’re a product person that ever has trouble explaining flows and communicating ideas to designers and engineers, I can’t recommend spending a few hours learning your way around After Effects highly enough. Being able to mock up exactly how you want things to look and work, in a few minutes, is INVALUABLE!

There seems to be a strong correlation between the quality of a Kickstarter video and how much money that project raises. Because of this, we wanted to make sure our video was decent. This was by far our biggest expense.
people  data  money  project  startup  idea  pkm 
november 2013 by aries1988
« earlier      
per page:    204080120160

Copy this bookmark:





to read