Files
pyShelf/docs/Book Scraping System.html
2023-03-17 19:46:25 -04:00

62 lines
1.9 KiB
HTML

<!DOCTYPE html>
<html>
<head>
<link rel="Stylesheet" type="text/css" href="style.css">
<link rel="alternate" type="application/rss+xml" title="RSS" href="rss.xml">
<title>Book Scraping System</title>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
<meta name="viewport" content="width=device-width, initial-scale=1">
</head>
<body>
<p>
<a href="index.html">Home</a> <a href="TOC.html">TOC</a> <a href="https://github.com/th3r00t/pyShelf.git">github</a>
</p>
<div id="Book Scraping System"><h1 id="Book Scraping System" class="header"><a href="#Book Scraping System">Book Scraping System</a></h1></div>
<div id="Book Scraping System-Introduction"><h2 id="Introduction" class="header"><a href="#Book Scraping System-Introduction">Introduction</a></h2></div>
<p>
The Book Scraping System is a custom ebook scraper built from the ground up
for pyShelf. It is designed to be a simple, easy to use, and easy to extend
system for scraping ebooks from the internet. It is powered by BeutifulSoup.
</p>
<p>
pyShelf features a recursive scraping algorithm that itterates over all
sub-folders in your collection.
</p>
<div id="Book Scraping System-Scrapers"><h2 id="Scrapers" class="header"><a href="#Book Scraping System-Scrapers">Scrapers</a></h2></div>
<ul>
<li class="done4">
epub #75f00edf
<li class="done4">
mobi #2fe4b161
<li class="done4">
pdf #ffa7e7f0
<li class="done0">
cbz #64b5da95
</ul>
<div id="Book Scraping System-Collections"><h2 id="Collections" class="header"><a href="#Book Scraping System-Collections">Collections</a></h2></div>
<ul>
<li class="done0">
Manual Collections #2e7e6fcf
<li class="done4">
Automated Collections #81db675a
</ul>
<div id="Book Scraping System-State"><h2 id="State" class="header"><a href="#Book Scraping System-State">State</a></h2></div>
<p>
Fully functional.
Awaiting further feature implementation.
</p>
</body>
</html>