[SQU] Squid was dead.

From: Y15(Linux) y.tagashira <taga@dont-contact.us>
Date: Thu, 07 Dec 2000 17:15:26 +0900

During the long time operation of squid under the high load condition,
Squid was dead.

I want to know a cause.

Please Any One Support Me.

//Select Loop failed!//
src/main.c main()内 (695line)

        if ((loop_delay = eventNextTime()) < 0)
            loop_delay = 0;
#if HAVE_POLL
        switch (comm_poll(loop_delay)) {
#else
        switch (comm_select(loop_delay)) {
#endif
        case COMM_OK:
            errcount = 0; /* reset if successful */
            break;
        case COMM_ERROR: -+
            errcount++; |
            debug(1, 0) ("Select loop Error. Retry %d\n", errcount);| Here
            if (errcount == 10) |
                fatal_dump("Select Loop failed!"); |
            break; -+
        ...
        ...

//recvmsg bug: copied//
<net/ipv4/tcp.c> (1159line)
int tcp_recvmsg(struct sock *sk, struct msghdr *msg,
                int len, int nonblock, int flags, int *addr_len)
{
        struct tcp_opt *tp = &(sk->tp_pinfo.af_tcp);
        struct wait_queue wait = { current, NULL };
        int copied = 0;
        u32 peek_seq;
        volatile u32 *seq; /* So gcc doesn't overoptimise */
...
...
        skb = skb_peek(&sk->receive_queue);
        do {
             if (!skb)
             break;
             /* Now that we have two receive queues this
              * shouldn't happen.
              */
             if (before(*seq, TCP_SKB_CB(skb)->seq)) { -+
                 printk(KERN_INFO "recvmsg bug: copied %X seq %X\ n",|
                        *seq, TCP_SKB_CB(skb)->seq); |Here
                 break; |
             } -+
...
...

TurboLinux 6.1J Linux2.2.15
/var/log/messages
--------------------------
Nov 27 18:14:04 peach (squid): Select Loop failed!
Nov 27 18:14:04 peach squid[396]: Squid Parent: child process 398 exited due to signal 6
Nov 27 18:14:07 peach squid[396]: Squid Parent: child process 749 started
Nov 27 18:18:36 peach (squid): Select Loop failed!
Nov 27 18:18:36 peach squid[396]: Squid Parent: child process 749 exited due to signal 6
Nov 27 18:18:39 peach squid[396]: Squid Parent: child process 770 started
Nov 27 18:23:42 peach kernel: recvmsg bug: copied 6159AABD seq 6159B065
Nov 27 18:24:14 peach last message repeated 11 times

--------------------------
Nov 28 00:03:57 peach kernel: recvmsg bug: copied 2539EEAC seq 2539F454
Nov 28 00:03:57 peach kernel: recvmsg bug: copied 2539EEAC seq 2539F454
Nov 28 00:03:57 peach kernel: recvmsg bug: copied 25D02D5D seq 25D03305
Nov 28 00:03:57 peach kernel: recvmsg bug: copied 2539EEAC seq 2539F454
Nov 28 00:03:58 peach kernel: recvmsg bug: copied 25D02D5D seq 25D03305
Nov 28 00:03:58 peach kernel: recvmsg bug: copied 2539EEAC seq 2539F454
Nov 28 00:04:04 peach last message repeated 18 times
Nov 28 00:04:04 peach kernel: recvmsg bug: copied 283704EB seq 28370A93

-omission-

Nov 28 00:37:48 peach kernel: recvmsg bug: copied 7305CA11 seq 7305CFB9
Nov 28 00:37:48 peach kernel: recvmsg bug: copied 992B042B seq 992B09D3
Nov 28 00:37:48 peach (squid): Select Loop failed!
Nov 28 00:37:49 peach squid[516]: Squid Parent: child process 1102 exited due to signal 6
Nov 28 00:37:52 peach squid[516]: Squid Parent: child process 1254 started
Nov 28 01:04:02 peach kernel: recvmsg bug: copied 9536CAD seq 9537255

----------------------
Nov 29 19:30:30 peach (squid): Select Loop failed!
Nov 29 19:30:31 peach squid[406]: Squid Parent: child process 407 exited due to signal 6
Nov 29 19:30:34 peach squid[406]: Squid Parent: child process 2571 started
Nov 29 19:42:01 peach (squid): Select Loop failed!
Nov 29 19:42:01 peach squid[406]: Squid Parent: child process 2571 exited due to signal 6
Nov 29 19:42:04 peach squid[406]: Squid Parent: child process 2605 started
Nov 29 19:44:15 peach kernel: recvmsg bug: copied CDD61A30 seq CDD61FD8
Nov 29 19:44:15 peach kernel: recvmsg bug: copied CDD01B76 seq CDD0211E
Nov 29 19:44:15 peach kernel: recvmsg bug: copied CDD61A30 seq CDD61FD8

-omission-

Nov 29 19:52:01 peach kernel: recvmsg bug: copied CDD61A30 seq CDD61FD8
Nov 29 19:52:01 peach kernel: recvmsg bug: copied CDD01B76 seq CDD0211E
Nov 29 19:52:01 peach (squid): Select Loop failed!
Nov 29 19:52:01 peach squid[406]: Squid Parent: child process 2605 exited due to signal 6
Nov 29 19:52:04 peach squid[406]: Squid Parent: child process 2637 started

-------------------------------------------------
- Red Hat Linux Certified Engineer(RHCE) -
- Yoshito Tagashira -
-------------------------------------------------
 taga@fsi.co.jp -
- Yokohama Japan -
-------------------------------------------------

--
To unsubscribe, see http://www.squid-cache.org/mailing-lists.html
Received on Thu Dec 07 2000 - 01:19:30 MST

This archive was generated by hypermail pre-2.1.9 : Tue Dec 09 2003 - 16:56:52 MST