Files
pyShelf/docs/Book Scraping System.html
2023-03-20 12:12:55 -04:00

62 lines
1.9 KiB
HTML

<!DOCTYPE html>
<html>
<head>
<link rel="Stylesheet" type="text/css" href="style.css">
<link rel="alternate" type="application/rss+xml" title="RSS" href="rss.xml">
<title>Book Scraping System</title>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
<meta name="viewport" content="width=device-width, initial-scale=1">
</head>
<body>
<p>
<a href="index.html">Home</a> <a href="TOC.html">TOC</a> <a href="https://github.com/th3r00t/pyShelf.git">github</a>
</p>
<div id="Book Scraping System"><h1 id="Book Scraping System" class="header"><a href="#Book Scraping System">Book Scraping System</a></h1></div>
<div id="Book Scraping System-Introduction"><h2 id="Introduction" class="header"><a href="#Book Scraping System-Introduction">Introduction</a></h2></div>
<p>
The Book Scraping System is a custom ebook scraper built from the ground up
for pyShelf. It is designed to be a simple, easy to use, and easy to extend
system for scraping ebooks from the internet. It is powered by BeutifulSoup.
</p>
<p>
pyShelf features a recursive scraping algorithm that itterates over all
sub-folders in your collection.
</p>
<div id="Book Scraping System-Scrapers"><h2 id="Scrapers" class="header"><a href="#Book Scraping System-Scrapers">Scrapers</a></h2></div>
<ul>
<li class="done4">
epub #2899a8e9
<li class="done4">
mobi #ec035720
<li class="done4">
pdf #05875e64
<li class="done0">
cbz #4a513e39
</ul>
<div id="Book Scraping System-Collections"><h2 id="Collections" class="header"><a href="#Book Scraping System-Collections">Collections</a></h2></div>
<ul>
<li class="done0">
Manual Collections #b07156f4
<li class="done0">
<a href="Automated Collections.html">Automated Collections</a> #f258c1f8
</ul>
<div id="Book Scraping System-State"><h2 id="State" class="header"><a href="#Book Scraping System-State">State</a></h2></div>
<p>
Fully functional.
Awaiting further feature implementation.
</p>
</body>
</html>