爱美女网爬虫[预览版] [23.09.09] [Windows]

Plain text
Copy to clipboard
Open code in new window
EnlighterJS 3 Syntax Highlighter
更新日志:
1.处理服务器404错误,不会中断下载
2.替换默认服务器地址
3.自动删除无法下载的空文件夹
更新日志: 1.处理服务器404错误,不会中断下载 2.替换默认服务器地址 3.自动删除无法下载的空文件夹
更新日志:
1.处理服务器404错误,不会中断下载
2.替换默认服务器地址
3.自动删除无法下载的空文件夹

Continue Reading

HTTrack Website Copier [网站下载器]

今天需要下载一个静态页面的网站,本来想直接保存html的结果看了一下页面贼多,于是果断放弃了。找工具进行处理,搜索了一下找到了这个开源免费的。用了一下效果还不错。

HTTrack is a free (GPL, libre/free software) and easy-to-use offline browser utility.

It allows you to download a World Wide Web site from the Internet to a local directory, building recursively all directories, getting HTML, images, and other files from the server to your computer. HTTrack arranges the original site’s relative link-structure. Simply open a page of the “mirrored” website in your browser, and you can browse the site from link to link, as if you were viewing it online. HTTrack can also update an existing mirrored site, and resume interrupted downloads. HTTrack is fully configurable, and has an integrated help system.

WinHTTrack is the Windows (from Windows 2000 to Windows 10 and above) release of HTTrack, and WebHTTrack the Linux/Unix/BSD release. See the download page.

Continue Reading

精品美女吧 爬虫【Windows】【23.04.16】

Plain text
Copy to clipboard
Open code in new window
EnlighterJS 3 Syntax Highlighter
精品美女吧 爬虫
Verson: 23.04.16
Blog: http://www.h4ck.org.cn
****************************************************************************************************
USAGE:
spider -h <help> -a <all> -q <search> -e <early stop>
Arguments:
-a <download all site images>
-h <display help text, just this>
Option Arguments:
-p <image download path>
-r <random index category list>
-c <single category url>
-e <early stop, work in site crawl mode only>
-s <site url eg: https://www.jpxgmn.net (no last backslash "/")>
****************************************************************************************************
精品美女吧 爬虫 Verson: 23.04.16 Blog: http://www.h4ck.org.cn **************************************************************************************************** USAGE: spider -h <help> -a <all> -q <search> -e <early stop> Arguments: -a <download all site images> -h <display help text, just this> Option Arguments: -p <image download path> -r <random index category list> -c <single category url> -e <early stop, work in site crawl mode only> -s <site url eg: https://www.jpxgmn.net (no last backslash "/")> ****************************************************************************************************
精品美女吧 爬虫
Verson: 23.04.16
Blog: http://www.h4ck.org.cn
****************************************************************************************************
USAGE:
spider -h <help> -a <all> -q <search> -e <early stop>
Arguments:
         -a <download all site images>
         -h <display help text, just this>
Option Arguments:
         -p <image download path>
         -r <random index category list>
         -c <single category url>
         -e <early stop, work in site crawl mode only>
         -s <site url eg: https://www.jpxgmn.net (no last backslash "/")>
****************************************************************************************************

Continue Reading

爱看美女网爬虫【Windows】【23.03.02】

Plain text
Copy to clipboard
Open code in new window
EnlighterJS 3 Syntax Highlighter
C:\Users\obaby>F:\Pycharm_Projects\sexy_girl_spider\dist\ikmn\ikmn.exe
****************************************************************************************************
USAGE:
spider -h <help> -a <all> -q <search> -e <early stop>
Arguments:
-a <download all site images>
-q <query the image with keywords>
-h <display help text, just this>
Option Arguments:
-p <image download path>
-r <random index category list>
-c <single category url>
-e <early stop, work in site crawl mode only>
-s <site url eg: https://www.ikmn.vip (no last backslash "/")>
****************************************************************************************************
C:\Users\obaby>F:\Pycharm_Projects\sexy_girl_spider\dist\ikmn\ikmn.exe **************************************************************************************************** USAGE: spider -h <help> -a <all> -q <search> -e <early stop> Arguments: -a <download all site images> -q <query the image with keywords> -h <display help text, just this> Option Arguments: -p <image download path> -r <random index category list> -c <single category url> -e <early stop, work in site crawl mode only> -s <site url eg: https://www.ikmn.vip (no last backslash "/")> ****************************************************************************************************
C:\Users\obaby>F:\Pycharm_Projects\sexy_girl_spider\dist\ikmn\ikmn.exe
****************************************************************************************************
USAGE:
spider -h <help> -a <all> -q <search> -e <early stop>
Arguments:
         -a <download all site images>
         -q <query the image with keywords>
         -h <display help text, just this>
Option Arguments:
         -p <image download path>
         -r <random index category list>
         -c <single category url>
         -e <early stop, work in site crawl mode only>
         -s <site url eg: https://www.ikmn.vip (no last backslash "/")>
****************************************************************************************************

Continue Reading

精品美女吧 爬虫【Windows】【22.12.23】

Plain text
Copy to clipboard
Open code in new window
EnlighterJS 3 Syntax Highlighter
精品美女吧 爬虫
Verson: 22.12.23
Blog: http://www.h4ck.org.cn
****************************************************************************************************
USAGE:
spider -h <help> -a <all> -q <search> -e <early stop>
Arguments:
-a <download all site images>
-q <query the image with keywords>
-h <display help text, just this>
****************************************************************************************************
精品美女吧 爬虫 Verson: 22.12.23 Blog: http://www.h4ck.org.cn **************************************************************************************************** USAGE: spider -h <help> -a <all> -q <search> -e <early stop> Arguments: -a <download all site images> -q <query the image with keywords> -h <display help text, just this> ****************************************************************************************************
精品美女吧 爬虫
Verson: 22.12.23
Blog: http://www.h4ck.org.cn
****************************************************************************************************
USAGE:
spider -h <help> -a <all> -q <search> -e <early stop>
Arguments:
         -a <download all site images>
         -q <query the image with keywords>
         -h <display help text, just this>
****************************************************************************************************

Continue Reading

秀人美女网爬虫 【Windows】【22.12.09】

Plain text
Copy to clipboard
Open code in new window
EnlighterJS 3 Syntax Highlighter
更新日志:
调整超时时间为30秒,解决由于服务器解析导致的下载失败。如果效果不好,请下载旧版本。
更新日志: 调整超时时间为30秒,解决由于服务器解析导致的下载失败。如果效果不好,请下载旧版本。
更新日志:
调整超时时间为30秒,解决由于服务器解析导致的下载失败。如果效果不好,请下载旧版本。

Continue Reading