recentpopularlog in

tsuomela : python   125

« earlier  
Analyzing Documents with TF-IDF | Programming Historian
"This lesson focuses on a foundational natural language processing and information retrieval method called Term Frequency - Inverse Document Frequency (tf-idf). This lesson explores the foundations of tf-idf, and will also introduce you to some of the questions and concepts of computationally oriented text analysis."
text-mining  python  digital-humanities  text-analysis  tutorial 
25 days ago by tsuomela
First Python Notebook — First Python Notebook 1.0 documentation
"A step-by-step guide to analyzing data with Python and the Jupyter Notebook."
data-science  python  programming  notebook 
december 2018 by tsuomela
Binder (beta)
"Have a repository full of Jupyter notebooks? With Binder, open those notebooks in an executable environment, making your code immediately reproducible by anyone, anywhere. "
data-curation  reproducible  python  ipython  programming  notebook  sharing  research  tool  github 
july 2018 by tsuomela
Topic Modeling in Python with NLTK and Gensim | DataScience+
"In this post, we will learn how to identify which topic is discussed in a document, called topic modeling. In particular, we will cover Latent Dirichlet Allocation (LDA): a widely used topic modelling technique. And we will apply LDA to convert set of research papers to a set of topics."
python  statistics  topic-modeling  digital-humanities  methods 
april 2018 by tsuomela
Python Programming for the Humanities by Folgert Karsdorp
"The programming language Python is widely used within many scientific domains nowadays and the language is readily accessible to scholars from the Humanities. Python is an excellent choice for dealing with (linguistic as well as literary) textual data, which is so typical of the Humanities. In this book you will be thoroughly introduced to the language and be taught to program basic algorithmic procedures. The book expects no prior experience with programming, although we hope to provide some interesting insights and skills for more advanced programmers as well. The book consists of 10 chapters. Chapter 5 and Chapter 6 are still in draft status and not ready for use."
python  programming  tutorial  digital-humanities 
january 2018 by tsuomela
Matter & Interactions | Contemporary calculus-based physics
"Matter & Interactions is a textbook by Ruth Chabay and Bruce Sherwood (John Wiley & Sons, 4th edition, 2015) that emphasizes a modern perspective on the calculus-based introductory physics curriculum taken by science and engineering students. it engages students in: Starting analyses from fundamental principles rather than secondary formulas Making macro-micro connections, based on the atomic nature of matter Modeling physical systems: making idealizations, simplifying assumptions, estimates Constructing computational models to predict the time evolution of system behavior"
physics  textbook  python  graphics 
january 2017 by tsuomela
GlowScript IDE
"GlowScript is an easy-to-use, powerful environment for creating 3D animations and publishing them on the web. Here at, you can write and run GlowScript programs right in your browser, store them in the cloud for free, and easily share them with others. Thanks to the RapydScript compiler, you can use VPython here."
programming  library  python  graphics  physics  browser 
january 2017 by tsuomela
"VPython makes it easy to create navigable 3D displays and animations, even for those with limited programming experience. Because it is based on Python, it also has much to offer for experienced programmers and researchers."
programming  library  python  graphics  physics 
january 2017 by tsuomela
Python For The… by Gordon Webster et al. [PDF/iPad/Kindle]
"Python For The Life Sciences is an intuitive and easy-to-follow introduction to computer programming, written specifically for biologists with no prior experience of writing code. This is a full course in Python programming, taught using real biological applications. Your purchase includes downloads of all of the code examples in the book, to try them, learn from them, and even use or adapt them for your own research. See below for details of our academic discount."
book  publisher  programming  python  life-sciences  biology 
november 2016 by tsuomela PyCX Project
"The PyCX Project aims to develop an online repository of simple, crude, yet easy-to-understand Python sample codes for dynamic complex systems simulations, including iterative maps, cellular automata, dynamical networks and agent-based models."
python  agent-based-model  complexity  programming  library  simulation 
july 2016 by tsuomela
DataCamp: The Easy Way To Learn R & Data Science Online
"Master data analysis from the comfort of your browser, at your own pace, tailored to your needs and expertise. Whether you want to learn R, Python or Data Visualization, we want to help!"
programming  data  analysis  r  python  tutorials 
june 2016 by tsuomela
Project Jupyter | Home
"The Jupyter Notebook is a web application that allows you to create and share documents that contain live code, equations, visualizations and explanatory text. Uses include: data cleaning and transformation, numerical simulation, statistical modeling, machine learning and much more."
programming  python  notebook  interactive 
june 2016 by tsuomela
Natural Language Toolkit — NLTK 3.0 documentation
"NLTK is a leading platform for building Python programs to work with human language data. It provides easy-to-use interfaces to over 50 corpora and lexical resources such as WordNet, along with a suite of text processing libraries for classification, tokenization, stemming, tagging, parsing, and semantic reasoning, wrappers for industrial-strength NLP libraries, and an active discussion forum. "
python  library  language  analysis  text-analysis 
june 2016 by tsuomela
graph-tool: Efficent network analysis with python
"Graph-tool is an efficient Python module for manipulation and statistical analysis of graphs (a.k.a. networks). Contrary to most other python modules with similar functionality, the core data structures and algorithms are implemented in C++, making extensive use of template metaprogramming, based heavily on the Boost Graph Library. This confers it a level of performance that is comparable (both in memory usage and computation time) to that of a pure C/C++ library."
python  programming  graphs  library 
may 2016 by tsuomela
"Turn a GitHub repo into a collection of interactive notebooks powered by Jupyter and Kubernetes."
python  programming  notebook  github  integration 
march 2016 by tsuomela
Python Data Analysis Library — pandas: Python Data Analysis Library
"pandas is an open source, BSD-licensed library providing high-performance, easy-to-use data structures and data analysis tools for the Python programming language."
python  data-exploration  data  statistics 
july 2014 by tsuomela
PyBossa is a free, open-source, platform for creating and running crowd-sourcing applications that utilise online assistance in performing tasks that require human cognition, knowledge or intelligence such as image classification, transcription, geocoding and more!
crowdsourcing  python  open-source  platform  programming  library 
july 2012 by tsuomela
Scrapy | An open source web scraping framework for Python
"Scrapy is a fast high-level screen scraping and web crawling framework, used to crawl websites and extract structured data from their pages. It can be used for a wide range of purposes, from data mining to monitoring and automated testing."
web-programming  programming  python  library  scraping  data-collection  online 
march 2011 by tsuomela
Sage: Open Source Mathematics Software
Sage is a free open-source mathematics software system licensed under the GPL. It combines the power of many existing open-source packages into a common Python-based interface.
mathematics  programming  computer  software  python  tools  science  open-source 
march 2010 by tsuomela
Overview — NetworkX v1.0rc1 documentation
NetworkX is a Python package for the creation, manipulation, and study of the structure, dynamics, and functions of complex networks.
programming  python  network-analysis  networks  graphs  libraries  visualization 
december 2009 by tsuomela
PDFMiner is a suite of programs that help extracting and analyzing text data of PDF documents. Unlike other PDF-related tools, it allows to obtain the exact location of texts in a page, as well as other extra information such as font information or ruled lines. It includes a PDF converter that can transform PDF files into other text formats (such as HTML). It has an extensible PDF parser that can be used for other purpoes instead of text analysis.
software  python  pdf  library  tools  programming 
june 2009 by tsuomela
« earlier      
per page:    204080120160

Copy this bookmark:

to read