Re: FIN_WAIT_[1,2] and LAST_ACK

From: Brandon Erhart (berhart_at_ErhartGroup.COM)
Date: 04/05/04

  • Next message: Chuck Swiger: "Re: FIN_WAIT_[1,2] and LAST_ACK"
    Date: Sun, 04 Apr 2004 16:07:38 -0600
    To: Chuck Swiger <cswiger@mac.com>
    
    

    Yes, it pays attention to /robots.txt.

    But, I am writing my own -- I don't want to use rsync, wget, anything like
    that. This is part of an archiving project, and it uses so many FDs because
    it has tons of connections opened to DIFFERENT servers at different times
    .. not just one site.

    Any advice on the timeouts? I don't really care about the RFC , honestly
    :-P. Like I said, I'm going for sheer speed.

    Brandon

    At 04:02 PM 4/4/2004, you wrote:
    >Brandon Erhart wrote:
    >>I am writing a network application that mirrors a given website (such as
    >>a suped-up "wget"). I use a lot of FDs, and was getting connect() errors
    >>when I would run out of local_ip:local_port tuples. I lowered the MSL so
    >>that TIME_WAIT would timeout very quick (yes, I know, this is "bad", but
    >>I'm going for sheer speed here), and it alleviated the problem a bit.
    >>However, I have run into a new problem. I am getting a good amount of
    >>blocks stuck in FIN_WAIT_1, FIN_WAIT_2 or LAST_ACK that stick around for
    >>a long while. I have been unable to find must information on a timeout
    >>for these states.
    >
    >Well, these are defined in RFC-791 (aka STD-5).
    >
    >If you want to mirror the content of a given website rapidly, a good
    >approach would be to use a tool like rsync and duplicate the changed
    >portions at the filesystem level rather than mirroring via HTTP requests.
    >
    >It would also be the case that using HTTP/1.1 pipelining ought to greatly
    >reduce the number of new connections you need to open, which ought to
    >speed up your program significantly while reducing load on the servers
    >you're mirroring.
    >
    >Since I've given some helpful advice (or so I think :-), perhaps you'll be
    >willing to listen to a word of caution: if your client is pushing so hard
    >that it exhausts the local machine's resources, you're very probably doing
    >something that reasonable website administrators would consider to be
    >abusive and you may cause denial-of-service conditions for other users of
    >that site.
    >
    >Does your tool pay attention to /robots.txt?
    >
    >--
    >-Chuck

    _______________________________________________
    freebsd-net@freebsd.org mailing list
    http://lists.freebsd.org/mailman/listinfo/freebsd-net
    To unsubscribe, send any mail to "freebsd-net-unsubscribe@freebsd.org"


  • Next message: Chuck Swiger: "Re: FIN_WAIT_[1,2] and LAST_ACK"

    Relevant Pages

    • Re: Mirroring 2 Machines with rsync
      ... > Anyone care to comment on this method of mirroring? ... I'm currently mirroring an L box to a K box. ... The rsync utility has one big problem with scaling up: ... Here is a shell script fragment that I modified to conceal the true ...
      (comp.sys.hp.hpux)
    • Re: Mirroring 2 Machines with rsync
      ... >> Anyone care to comment on this method of mirroring? ... > I'm currently mirroring an L box to a K box. ... > The rsync utility has one big problem with scaling up: ... > dosync /usr ...
      (comp.sys.hp.hpux)
    • Problem setting up a mirror
      ... I've installed and configured the anonftpsync script as per its instructions, but when I run it the log says the following: ... rsync: connection unexpectedly closed ... mirroring /pool exited with exitcode 12 ... To UNSUBSCRIBE, email to debian-user-REQUEST@xxxxxxxxxxxxxxxx with a subject of "unsubscribe". ...
      (Debian-User)
    • Re: SD 744t DVD-RAM drive in bag...
      ... lets you dump to a DVD-RAM disk when you want it to happen. ... good for bag type setups because when you hit Stop the Mirroring starts ... But once SD's "dump" operation starts I don't think you can stop it, ... It just spins up and starts writing until all the files have ...
      (rec.arts.movies.production.sound)
    • Re: Sync files on two computers
      ... Rsync is a mirroring tool; Unison is a synchronizer. ... Tridgell and Paul Mackerras, for performing updates. ...
      (comp.security.ssh)