Bird 2.0.9 Crash when 16K routes sent from our App to Bird module

Maria Matejka maria.matejka at nic.cz
Fri Sep 23 17:02:22 CEST 2022


Hello!

Could you please provide a description how to replicate this bug without 
any external app? It's quite uncommon to receive 16k routes by OSPF so 
we don't test such scenarios and by your report, I can imagine lots of 
variants to check.

Thanks
Maria

On 9/23/22 16:38, mukund via Bird-users wrote:
> 
> Hi,
> 
> We are using bird version 2.0.9.  We have IXIA traffic which is sending 
> 16K OSPF  routes to our App which
> we send to Bird. Issue is seen when we restart our App. As part of our 
> restart app handling, we restart Bird as well.
> We see a crash  with following BT
> 
> VT33-VT33_PRI:~/backup/cores/coredump_2022-09-20.20.31.33# gdb bird 
> core.bird.30975
> GNU gdb (GDB) Red Hat Enterprise Linux 7.6.1-120.0.1.el7
> Copyright (C) 2013 Free Software Foundation, Inc.
> License GPLv3+: GNU GPL version 3 or later 
> <http://gnu.org/licenses/gpl.html>
> This is free software: you are free to change and redistribute it.
> There is NO WARRANTY, to the extent permitted by law.  Type "show copying"
> and "show warranty" for details.
> This GDB was configured as "x86_64-redhat-linux-gnu".
> For bug reporting instructions, please see:
> <http://www.gnu.org/software/gdb/bugs/>...
> Reading symbols from 
> /home/talariuser/backup/cores/coredump_2022-09-20.20.31.33/bird...done.
> [New LWP 30975]
> [Thread debugging using libthread_db enabled]
> Using host libthread_db library "/lib64/libthread_db.so.1".
> Core was generated by `/home/talariuser/bird/sbin/bird -f'.
> Program terminated with signal 8, Arithmetic exception.
> *#0  0x000000000047262e in sl_free (s=0x783170, oo=0x7ffff6e1fc60) at 
> /tn-build/src/third_party/bird/lib/slab.c:315*
> 315     /tn-build/src/third_party/bird/lib/slab.c: No such file or 
> directory.
> Missing separate debuginfos, use: debuginfo-install 
> glibc-2.17-325.0.1.el7_9.x86_64 openssl-libs-1.0.2k-22.el7_9.x86_64 
> zlib-1.2.7-19.el7_9.x86_64
> (gdb) bt
> #0  0x000000000047262e in sl_free (s=0x783170, oo=0x7ffff6e1fc60) at 
> /tn-build/src/third_party/bird/lib/slab.c:315
> #1  0x0000000000462e33 in fib_delete (f=0x75f508, E=0x7ffff6e1fc60) at 
> /tn-build/src/third_party/bird/nest/rt-fib.c:479
> #2  0x0000000000420915 in rt_sync (p=0x75f350) at 
> /tn-build/src/third_party/bird/proto/ospf/rt.c:2104
> #3  0x00000000004218da in ospf_rt_spf.part.31 (p=0x75f350) at 
> /tn-build/src/third_party/bird/proto/ospf/rt.c:1721
> #4  0x000000000041f403 in ospf_rt_spf (p=0x75f350) at 
> /tn-build/src/third_party/bird/proto/ospf/rt.c:1692
> #5  0x000000000042fb8f in ospf_disp (timer=0x782f10) at 
> /tn-build/src/third_party/bird/proto/ospf/ospf.c:482
> #6  0x00000000004736cf in timers_fire (loop=0x74e0e0 <main_timeloop>) at 
> /tn-build/src/third_party/bird/lib/timer.c:235
> #7  0x0000000000408d0a in io_loop () at 
> /tn-build/src/third_party/bird/sysdep/unix/io.c:2260
> #8  0x0000000000405af6 in main (argc=2, argv=0x7fffffffeb68) at 
> /tn-build/src/third_party/bird/sysdep/unix/main.c:952
> 
> Since all routes are newly passed and OSPF SPF calculation is done from 
> scratch, not sure why delete / free has an issue.
> 
> Also issue is not seen when there are less routes say ~200. Seems to be 
> specific to huge number.
> If any pointers to this will be helpful
> 
> Thanks in advance .
> 
> Regards
> Mukund
> 
> 
> 
-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/pkcs7-signature
Size: 2839 bytes
Desc: S/MIME Cryptographic Signature
URL: <http://trubka.network.cz/pipermail/bird-users/attachments/20220923/e6fd0257/attachment.p7s>


More information about the Bird-users mailing list