From debbugs-submit-bounces@debbugs.gnu.org Thu May 21 08:52:06 2020 Received: (at 41429) by debbugs.gnu.org; 21 May 2020 12:52:06 +0000 Received: from localhost ([127.0.0.1]:55576 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1jbkg7-0001Gl-Ho for submit@debbugs.gnu.org; Thu, 21 May 2020 08:52:06 -0400 Received: from mail-il1-f171.google.com ([209.85.166.171]:38832) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1jbkg6-0001GG-8i for 41429@debbugs.gnu.org; Thu, 21 May 2020 08:52:02 -0400 Received: by mail-il1-f171.google.com with SMTP id j2so6948638ilr.5 for <41429@debbugs.gnu.org>; Thu, 21 May 2020 05:52:02 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=from:to:cc:subject:references:date:in-reply-to:message-id :user-agent:mime-version; bh=218tO2+INfz4QNm95jZJmOH3sfNEeT/JW/6z9Kkl0k8=; b=AZ0BrJAk97trGRb3dzP4P2/g5l/ilyfgJWXyZhtlzahK38GglUlLdJgT6ltAk/z7ub K4VhbklrSsGtGlHvdqckZpwtl+2VVs7cNQDVGbnipy0pNhHeKk6dyB3VIdcDaLNxrX5k H7Efs3r9VHL/Q3ykvchO8e+w+8NfrfLU73ILQV/47rjy4AYcsh6rlNk0FNDkD6QKeLQg WQjb3apq+sugIrd9D/TXP/Ye/K1HkhogA56Tqfv9HxMkfCieThdzaqdfD4Qk4+S18hbC tTgy7aTezDx9LEW8ySyX8NDEe6KnlNukYMmY52c36CuiO6/AYHNjdqlsse8WJuoA8Zel Twmg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:to:cc:subject:references:date:in-reply-to :message-id:user-agent:mime-version; bh=218tO2+INfz4QNm95jZJmOH3sfNEeT/JW/6z9Kkl0k8=; b=cgFQeH4l/bqMUtMWV+vqCdErsZgNt3KhrQlYdvTuYdj8onZ3DWp60brlMTRBFbaT3/ Csa4m+TKOV1FMDrotGr7Lr7ZNvtNKfTF89AYggrbDXhWQBtSwhcd8jYC9A2Xp5j/B0xM ov3hceqnXh7MOZUJiQUxzm7L7FCfVycagwwowgqfHUXkeEt/7YjUqNhGvhaVsjdgHjyB /jYp+ycqrxGlhazQtNrBlFQDta7wL+Kd1llxLhYska8eVrc0g1vdm4gApmD5KXypAwjf PwxKVkkNVPxPRq8nAHLn1AQmOlNGUV7g1+vEvaM3au0YLtqMERtxiUEvk/Wz3CrO+s04 4E6Q== X-Gm-Message-State: AOAM532m3cqbxmRUvdIqtc1rBXH2euqNLW1ByWLWjzYmWQENnEiesFNn 2Lpry+nG/S/747oaEFBQRA0+HwFz X-Google-Smtp-Source: ABdhPJyr95rVmj4CaOXC0fzzG7tIpKgjxc4e5yrfNmhG/lkyZJLi7QEotDg9VSpQTqel5dxoI1yjVQ== X-Received: by 2002:a92:3954:: with SMTP id g81mr8648442ila.105.1590065516037; Thu, 21 May 2020 05:51:56 -0700 (PDT) Received: from gazelle-pro (172-221-246-205.res.spectrum.com. [172.221.246.205]) by smtp.gmail.com with ESMTPSA id i15sm2891226ils.74.2020.05.21.05.51.54 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Thu, 21 May 2020 05:51:55 -0700 (PDT) From: Katherine Cox-Buday To: Efraim Flashner Subject: Re: bug#41429: Shepherd Sometimes Crashes References: <87d06yc7t4.fsf@gmail.com> <20200521121443.GC958@E5400> Date: Thu, 21 May 2020 07:51:54 -0500 In-Reply-To: <20200521121443.GC958@E5400> (Efraim Flashner's message of "Thu, 21 May 2020 15:14:43 +0300") Message-ID: <87sgftbgd1.fsf@gmail.com> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/26.3 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain X-Spam-Score: 0.0 (/) X-Debbugs-Envelope-To: 41429 Cc: 41429@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -1.0 (-) Efraim Flashner writes: > On Wed, May 20, 2020 at 09:59:03PM -0500, Katherine Cox-Buday wrote: >> I am running shepherd as a userspace service manager on an alien distro. >> Occassionally (often enough as to cause concern), Shepherd is crashing. >> I am unable to narrow down a cause, but anecdotally, it seems to happen >> more often when a service it's managing fails repeatedly and is >> disabled. >> >> I'm running `strace` against the Shepherd process in an attempt to >> submit a better bug report, but this is all I have for now. Maybe others >> have also seen this behavior. > > I found it happens less often with shepherd-0.8. What version are you > running? Also possibly related, do you have mismatched versions of guile > between guix packages and your distro's native packages? Sorry, I forgot to include the version! I am running 0.8 from a store which I update ~1 week. > I've also sometimes found shepherd to crash when I add a service where > the start command is "wrong", as though the error were so bad that > shepherd says "Nope! That's it! I quit!" I'm doing very standard things with `make-forkexec-constructor`, so I wouldn't expect any problems there. Your comment is kind of scary though! Shepherd is the thing I want to stay up no matter what since it's responsible for monitoring and restarting things. The idea that a misbehaving or poorly written service could bring down the entire Shepherd process is a problem! Is there no isolation? > I'd suggest looking at .config/shepherd/shepherd.log but it's rather > sparse. Still, it might have something useful. Yes, this is the first place I looked, but unfortunately there wasn't much usable informatino. -- Katherine