bruno

"mr storylets"

writer (derogatory). lead designer on Fallen London.

http://twitter.com/notbrunoagain


THESE POSTS ARE PROVIDED “AS IS”, WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE POSTS OR THE USE OR OTHER DEALINGS IN THE POSTS.


Bluesky
brunodias.bsky.social
You must log in to comment.

in reply to @bruno's post:

in reply to @bruno's post:

Depends on what you need I think. If it's just text then dropping a URL into wget will probably do it. If it's everything then you're going to have more of a challenge. As the other comment said printing is straightforward and you get what you see. You could also try recursive wget with a fake user agent, although that will still be incomplete for most new sites.

If you're looking to also grab transitively-linked image, JS, and CSS files then the best/easiest option might be a crawler. It's hitting an ant with a sledgehammer, but there's a menagerie of open-source web crawlers out there in various languages.

Depending on the size of the work involved, another option would be getting a packet sniffer like Charles Proxy or Wireshark, and manually go through the game with session recording enabled. Extracting the work from the saved session is an exercise left to the reader.