Starter core dump on rhel9
Hi,
we have been testing Tango setup on rhel9 machine. What we have noticed is that after fresh installation everything works fine. Then when starting any DeviceServer (e.g TangoTest) either via Starter or with commandline, the Starter crashes every time with core dump:
[admin@ml ~]$ sudo systemctl status tango-starter.service
× tango-starter.service - Tango starter
Loaded: loaded (/usr/lib/systemd/system/tango-starter.service; disabled; preset: disabled)
Active: failed (Result: core-dump) since Tue 2023-07-11 14:16:27 CEST; 1s ago
Duration: 4.420s
Process: 31383 ExecStart=/usr/bin/tango-starter start (code=exited, status=0/SUCCESS)
Main PID: 31425 (code=dumped, signal=ABRT)
CPU: 266ms
Jul 11 14:16:20 ml.machine systemd[1]: Starting Tango starter...
Jul 11 14:16:20 ml.machine runuser[31423]: pam_unix(runuser:session): session opened for user tango(uid=990) by (uid=0)
Jul 11 14:16:20 ml.machine runuser[31423]: pam_unix(runuser:session): session closed for user tango
Jul 11 14:16:22 ml.machine tango-starter[31383]: Starting Tango starter daemon[ OK ]
Jul 11 14:16:22 ml.machine systemd[1]: Started Tango starter.
Jul 11 14:16:27 ml.machine systemd-coredump[31442]: [🡕] Process 31425 (Starter) of user 990 dumped core.
Jul 11 14:16:27 ml.machine systemd[1]: tango-starter.service: Main process exited, code=dumped, status=6/ABRT
Jul 11 14:16:27 ml.machine systemd[1]: tango-starter.service: Failed with result 'core-dump'.
[admin@ml ~]$ sudo journalctl -xeu tango-starter.service
░░
░░ The job identifier is 42049.
Jul 11 14:16:27 ml.machine systemd-coredump[31442]: [🡕] Process 31425 (Starter) of user 990 dumped core.
░░ Subject: Process 31425 (Starter) dumped core
░░ Defined-By: systemd
░░ Support: https://access.redhat.com/support
░░ Documentation: man:core(5)
░░
░░ Process 31425 (Starter) crashed and dumped core.
░░
░░ This usually indicates a programming error in the crashing program and
░░ should be reported to its vendor as a bug.
Jul 11 14:16:27 ml.machine systemd[1]: tango-starter.service: Main process exited, code=dumped, status=6/ABRT
░░ Subject: Unit process exited
░░ Defined-By: systemd
░░ Support: https://access.redhat.com/support
░░
░░ An ExecStart= process belonging to unit tango-starter.service has exited.
░░
░░ The process' exit code is 'dumped' and its exit status is 6.
Jul 11 14:16:27 ml.machine systemd[1]: tango-starter.service: Failed with result 'core-dump'.
░░ Subject: Unit failed
░░ Defined-By: systemd
░░ Support: https://access.redhat.com/support
░░
░░ The unit tango-starter.service has entered the 'failed' state with result 'core-dump'.
[admin@ml ~]$ Starter ml
/usr/include/c++/11/bits/stl_vector.h:1045: std::vector<_Tp, _Alloc>::reference std::vector<_Tp, _Alloc>::operator[](std::vector<_Tp, _Alloc>::size_type) [with _Tp = Starter_ns::ControlledServer; _Alloc = std::allocator<Starter_ns::ControlledServer>; std::vector<_Tp, _Alloc>::reference = Starter_ns::ControlledServer&; std::vector<_Tp, _Alloc>::size_type = long unsigned int]: Assertion '__n < this->size()' failed.
Aborted (core dumped)
Starting e.g TangoTest via commandline works fine.
Setup:
- VM rhel9 with libtango 9.3.6 (TSD), omniORB 4.3.0, zeromq 4.3,4 + cppTango!1092 (4c237e57) patch
- Tango Host on rhel7 VM with libtango 9.3.5 (TSD), omniORB 4.2.2, zeromq 4.2.2
We can perform more tests if necessary.
Cheers
Michal, SOLARIS