Skip to content
Snippets Groups Projects
username-removed-1390245's avatar
pushed to branch master at dlab-indecol / moose
username-removed-1390245's avatar
pushed to branch master at dlab-indecol / moose
username-removed-1390245's avatar
pushed to branch master at dlab-indecol / moose
username-removed-867288's avatar
pushed to branch master at dlab-indecol / web_trawler
  • fcb69ccb · remove some readme spelling
username-removed-1390245's avatar
pushed to branch master at dlab-indecol / moose
username-removed-1390245's avatar
pushed to branch master at dlab-indecol / moose
username-removed-1390245's avatar
pushed to branch master at dlab-indecol / moose
username-removed-1390245's avatar
pushed to branch master at dlab-indecol / moose
  • ff20fb04 · running regardless of __name__
username-removed-1390245's avatar
pushed to branch master at dlab-indecol / moose
username-removed-1390245's avatar
pushed to branch master at dlab-indecol / moose
username-removed-1390245's avatar
pushed to branch master at dlab-indecol / moose
username-removed-1390245's avatar
pushed to branch master at dlab-indecol / moose
  • 277eb7f4 · logging to file, added __main__
username-removed-1390245's avatar
pushed to branch master at dlab-indecol / moose
username-removed-1390245's avatar
pushed to branch master at dlab-indecol / moose
username-removed-871665's avatar
opened merge request !1 "removing .idea pycharm stuff and creating gitignore file" at dlab-indecol / moose
username-removed-871665's avatar
pushed new branch gitignore at dlab-indecol / moose
username-removed-871665's avatar
created project dlab-indecol / moose
username-removed-867288's avatar
commented on issue #3 "web_trawler should be able to trawl all web pages in a domain, especially now that there's a y/n prompt" at dlab-indecol / web_trawler
but we can still have the option non-interactive, just trawl all first/second order links...
username-removed-1390245's avatar
commented on issue #3 "web_trawler should be able to trawl all web pages in a domain, especially now that there's a y/n prompt" at dlab-indecol / web_trawler

Non-interactive mode with regard to the file links should still be default, I agree. But if you'd like the trawler to jump to linked web pages in...

username-removed-1390245's avatar
commented on issue #4 "Make robust for "too many connections" errors" at dlab-indecol / web_trawler

Web_trawler already uses both multiprocessing and multithreading. I think all that's needed is to catch certain kinds of errors, time.sleep for a w...