Bird 2.0.9 Crash when 16K routes sent from our App to Bird module

Ondrej Zajicek santiago at crfreenet.org
Tue Sep 27 14:57:28 CEST 2022


On Fri, Sep 23, 2022 at 02:38:59PM +0000, mukund via Bird-users wrote:
>  
>  Hi,
> We are using bird version 2.0.9.  We have IXIA traffic which is sending 16K OSPF  routes to our App which we send to Bird. Issue is seen when we restart our App. As part of our restart app handling, we restart Bird as well.We see a crash  with following BT
> VT33-VT33_PRI:~/backup/cores/coredump_2022-09-20.20.31.33# gdb bird core.bird.30975GNU gdb (GDB) Red Hat Enterprise Linux 7.6.1-120.0.1.el7Copyright (C) 2013 Free Software Foundation, Inc.License GPLv3+: GNU GPL version 3 or later <http://gnu.org/licenses/gpl.html>This is free software: you are free to change and redistribute it.There is NO WARRANTY, to the extent permitted by law.  Type "show copying"and "show warranty" for details.This GDB was configured as "x86_64-redhat-linux-gnu".For bug reporting instructions, please see:<http://www.gnu.org/software/gdb/bugs/>...Reading symbols from /home/talariuser/backup/cores/coredump_2022-09-20.20.31.33/bird...done.[New LWP 30975][Thread debugging using libthread_db enabled]Using host libthread_db library "/lib64/libthread_db.so.1".Core was generated by `/home/talariuser/bird/sbin/bird -f'.Program terminated with signal 8, Arithmetic exception.#0  0x000000000047262e in sl_free (s=0x783170, oo=0x7ffff6e1fc60) at /tn-build/src/third_party/bird/lib/slab.c:315315     /tn-build/src/third_party/bird/lib/slab.c: No such file or directory.Missing separate debuginfos, use: debuginfo-install glibc-2.17-325.0.1.el7_9.x86_64 openssl-libs-1.0.2k-22.el7_9.x86_64 zlib-1.2.7-19.el7_9.x86_64(gdb) bt#0  0x000000000047262e in sl_free (s=0x783170, oo=0x7ffff6e1fc60) at /tn-build/src/third_party/bird/lib/slab.c:315#1  0x0000000000462e33 in fib_delete (f=0x75f508, E=0x7ffff6e1fc60) at /tn-build/src/third_party/bird/nest/rt-fib.c:479#2  0x0000000000420915 in rt_sync (p=0x75f350) at /tn-build/src/third_party/bird/proto/ospf/rt.c:2104#3  0x00000000004218da in ospf_rt_spf.part.31 (p=0x75f350) at /tn-build/src/third_party/bird/proto/ospf/rt.c:1721#4  0x000000000041f403 in ospf_rt_spf (p=0x75f350) at /tn-build/src/third_party/bird/proto/ospf/rt.c:1692#5  0x000000000042fb8f in ospf_disp (timer=0x782f10) at /tn-build/src/third_party/bird/proto/ospf/ospf.c:482#6  0x00000000004736cf in timers_fire (loop=0x74e0e0 <main_timeloop>) at /tn-build/src/third_party/bird/lib/timer.c:235#7  0x0000000000408d0a in io_loop () at /tn-build/src/third_party/bird/sysdep/unix/io.c:2260#8  0x0000000000405af6 in main (argc=2, argv=0x7fffffffeb68) at /tn-build/src/third_party/bird/sysdep/unix/main.c:952
> Since all routes are newly passed and OSPF SPF calculation is done from scratch, not sure why delete / free has an issue.
> Also issue is not seen when there are less routes say ~200. Seems to be specific to huge number.If any pointers to this will be helpful
> Thanks in advance .

Hi

Could you send us core dump and bird binary? Also could you try 2.0.10?

-- 
Elen sila lumenn' omentielvo

Ondrej 'Santiago' Zajicek (email: santiago at crfreenet.org)
OpenPGP encrypted e-mails preferred (KeyID 0x11DEADC3, wwwkeys.pgp.net)
"To err is human -- to blame it on a computer is even more so."



More information about the Bird-users mailing list