recentpopularlog in

aries1988 : calibre   27

Ebook Conversion — calibre User Manual
Directory Description
input This contains the HTML output by the Input Plugin. Use this to debug the Input Plugin.
parsed The result of pre-processing and converting to XHTML the output from the Input Plugin. Use to debug structure detection.
structure Post structure detection, but before CSS flattening and font size conversion. Use to debug font size conversion and CSS transforms.
processed Just before the ebook is passed to the output plugin. Use to debug the Output Plugin.
calibre  tool  ebook 
january 2017 by aries1988
Frequently Asked Questions — calibre User Manual
Note If you are concerned about giving calibre access to your email account, simply create a new free email account with GMX or Hotmail and use it only for calibre.
calibre  email 
april 2016 by aries1988
python - Remove a tag using BeautifulSoup but keep its contents - Stack Overflow
html = "<p>Good, <b>bad</b>, and <i>ug<b>l</b><u>y</u></i></p>"
invalid_tags = ['b', 'i', 'u']
print strip_tags(html, invalid_tags)
The result is:

<p>Good, bad, and ugly</p>
beautifulsoup  calibre  example 
march 2013 by aries1988
Ebook Conversion — calibre User Manual
input This contains the HTML output by the Input Plugin. Use this to debug the Input Plugin.
parsed The result of pre-processing and converting to XHTML the output from the Input Plugin. Use to debug structure detection.
structure Post structure detection, but before CSS flattening and font size conversion. Use to debug font size conversion and CSS transforms.
processed Just before the ebook is passed to the output plugin. Use to debug the Output Plugin.
calibre  ebook  explained 
march 2013 by aries1988
Adding your favorite news website — calibre User Manual
Remove articles in feeds based on a string in the article title or url.
calibre  ebook  kindle  news  rss  learn  explained  example 
march 2013 by aries1988
Blogspot Feeds not retrieved by Calibre - MobileRead Forums
4. The only case where you can mix the feeds and get a good result is where feed itself contains complete article text. My example mixes two feeds that exhibit such behavior and that is why it works. Pay attention to the line in recipe

Code:
use_embedded_content = True
That instructs calibre to get the article text from the feed and not to open the URL in the feed.
calibre  learn  rss  kindle 
march 2013 by aries1988
How Do I Get Rid of the DRM on My Ebooks and Video?
How Do I Get Rid of the DRM on My Ebooks and Video?
howto  calibre  ebook  pkm 
october 2012 by aries1988
Beautiful Soup documentation
Beautiful Soup 是用Python写的一个HTML/XML的解析器,它可以很好的处理不规范标记并生成剖析树(parse tree)。 它提供简单又常用的导航(navigating),搜索以及修改剖析树的操作。它可以大大节省你的编程时间。 对于Ruby,使用Rubyful Soup。

这个文档说明了Beautiful Soup 3.0主要的功能特性,并附有例子。 从中你可以知道这个库有哪些好处,它是怎样工作的, 怎样让它帮做你想做的事以及你该怎样做当它做的和你期待不一样。
doc  python  calibre  recipe  ebook 
june 2012 by aries1988
recipe Guide_advanced – calibre
Advanced Customization Tips
1. Needing authentication?
2. Customizing feed parsing
3. Content from blog's feeds: fiddling with the article URLS
4. Getting obfuscated content
5. Parsing an index from web pages: a taste of soup
6. Customising styles on your side
7. 'BeautifulSoup'. How to slice and dice your fetched content
7.1 Semi-automated slicing
7.2 Customizing your slicing
8. Handling 'paged' content
9. Dealing with bad HTML markup
10. Extracting, modifying and adding content to page
10.1 Extracting text from a webpage
10.2 Translate a string via Google Translate
10.3 Adding additional content to a webpage
11. What else can you do with recipes?
calibre  howto  ebook  mobi 
june 2012 by aries1988
Beautiful Soup documentation
Beautiful Soup 是用Python写的一个HTML/XML的解析器,它可以很好的处理不规范标记并生成剖析树(parse tree)。 它提供简单又常用的导航(navigating),搜索以及修改剖析树的操作。它可以大大节省你的编程时间。 对于Ruby,使用Rubyful Soup。
beautifulsoup  html  python  calibre 
october 2011 by aries1988
API Documentation for recipes — calibre User Manual
The API for writing recipes is defined by the BasicNewsRecipe
abort_recipe_processing(msg)[source]
add_toc_thumbnail(article, src)[source]
classmethod adeify_images(soup)[source]
cleanup()[source]
clone_browser(br)[source]
download()[source]
extract_readable_article(html, url)[source]
get_article_url(article)[source]
get_browser(*args, **kwargs)[source]
get_cover_url()[source]
get_feeds()[source]
get_masthead_title()[source]
get_masthead_url()[source]
get_obfuscated_article(url)[source]
classmethod image_url_processor(baseurl, url)[source]
is_link_wanted(url, tag)[source]
javascript_login(browser, username, password)[source]
parse_feeds()[source]
parse_index()[source]
populate_article_metadata(article, soup, first)[source]
postprocess_book(oeb, opts, log)[source]
postprocess_html(soup, first_fetch)[source]
preprocess_html(soup)[source]
preprocess_raw_html(raw_html, url)[source]
classmethod print_version(url)[source]
skip_ad_pages(soup)[source]
sort_index_by(index, weights)[source]

articles_are_obfuscated
auto_cleanup
auto_cleanup_keep
center_navbar
conversion_options
cover_margins
delay
description
encoding
extra_css
feeds
filter_regexps
ignore_duplicate_articles
keep_only_tags
language
masthead_url
match_regexps
max_articles_per_feed
needs_subscription
no_stylesheets
oldest_article
preprocess_regexps
publication_type
recipe_disabled
recursions
remove_attributes
remove_empty_feeds
remove_javascript
remove_tags
remove_tags_after
remove_tags_before
requires_version
reverse_article_order
simultaneous_downloads
summary_length
template_css
timefmt
timeout
title
use_embedded_content
use_javascript_to_login
recipe  calibre  learn  explained 
october 2011 by aries1988

Copy this bookmark:





to read