recentpopularlog in

skinnymuch : scraping   67

Premium Dedicated Proxies. Designed For High-Scope Data Mining
Starts at 100 proxies for $178 per month. 7 day trial available by contact.
paid  proxies  $AFMA_umbrella  scraping  trial 
april 2018 by skinnymuch
dredei/TwitterPhotoDownloader: Download all photo of a user from media page.
No clue if this does video too. Window app needing .Net Framework 4.6 and whatever Gecko is, is lame. Haven't tried with Wine.
twitter  APIs  hacking  scraping  rare  open_source  bookmarked_on_site  C#  windows  apps 
july 2017 by skinnymuch
SiteSucker for macOS
SiteSucker 2.11.1 - Automatically downloads
apps  dev  paid  K  mac  scraping  price:$5  app_stores 
june 2017 by skinnymuch
Is ‘Inside Out’ to Pixar what ‘Frozen’ is to Disney lively films? | World News - INTERNATIONAL HEADLINES
Not sure how I got to this page. I think from some Bing search result. It is horrible writing. Too weird to be done by a cheap non-native English copywrite. Def content spinning.

And it ranked..maybe not on Google, but even ranking on Bing and having me click means some people are visiting this site.

Plus, they have two small pics of the movies near the top. So some part is scraping too.
--
"The latest Pixar movie, Inside Out, is creating an uproar that is similar to the one that Frozen did last year (though not as crazy which could be that this mov"
content_spinning  markov  work  MFA  spam  copy  SEO  blackhat  examples  scraping  ideas 
july 2015 by skinnymuch
ESPN Insider Articles Free
Posting ESPN Insider articles on a Google Adsense ridden site with almost no design. And the scraping is badly done too. But still, def something people want. Found it via a one poster, likely the site owner, posting a link to it in a related forum discussion in early 2013. Site is still up as of late 2014. -- Bottom says: "Powered by Howtoknow.us"
content_scraping  scraping  ideas  copy  SEO  bluehat  sports  fantasy_sports  paywalls 
september 2014 by skinnymuch
import•io - Structured Web Data Scraping
"information from the web into useable data

Turn any website into a table of data or an API in minutes without writing any code"
SaaS  scraping 
march 2014 by skinnymuch
dwillis/feedbag · GitHub
"Feedbag is a Ruby library for the auto-discovery of syndicated feeds (RSS/Atom)."

_Give it a url and it'll try finding the feed for the site_
gems  ruby  open_source  parsing  scraping  automation  feeds 
may 2013 by skinnymuch
pauldix/feedzirra · GitHub
"A feed fetching and parsing library that treats the internet like Godzilla treats Japan: it dominates and eats all."
ruby  open_source  scraping  feeds  automation 
may 2013 by skinnymuch
ParseRSS | Full-Text Articles from RSS Feeds
"Retrieve full-text mobile-optimized JSON-encoded articles from a RSS feed"

_It uses the Streamified.me API. Not sure how much of the app is just a wrapper of that and how much is actually legit coding_
scraping  closed_source  simple  RSS  feeds  web_tools 
may 2013 by skinnymuch
All your news,updates and trends... In-A-Gist
"40+ channels on various subjects and latest trends top ranked stories from trusted sources based on popularity and quality sign in with twitter and choose your channels"

"In-A-Gist algorithmically curates tweets based on popularity in real-time. We collate tweets on the same topic and this page is built from such curated tweets. We keep refreshing this page as and when we find popular tweets on topics mentioned in the tweet. They are presented in the "Related Tweets" section."
twitter  API  scraping  idea 
march 2013 by skinnymuch
john-griffin/mechanize-content · GitHub
"Returns the most important pieces of content on a web page. Finds the best block of text, image and title by analysing the page content."
readability  parse  scraping 
march 2013 by skinnymuch
2092202414 / 209-220-2414 / 209 220 2414
This site is just scraping http://800notes.com which always seems to be at the top of Google rankings for phone number searches
blackhat  automation  scraping  steal  content  web  2.0  user  generated 
july 2012 by skinnymuch
WatirMelon | A 93% Watir Based Blog by Alister Scott
Claims to be whitehat and that is what we see, but this blog is filled with not so whitehat gold [for a non-blackhat[er] [Ruby] [programmer]]
whitehat  masquerading  blackhat  automation  screen  scraping  testing  automated  generate  data  blog  watir 
june 2012 by skinnymuch

Copy this bookmark:





to read