QEMU LoongArch regression after merging LASX changes
Host environment
- Operating system: Debian Linux Bookworm
- OS/kernel version: 6.2.16-12-pve
- Architecture: x86_64
- QEMU flavor: qemu-system-loongarch64
- QEMU version: v8.1.0-1000-g494a6a2c
- QEMU command line:
./qemu-system-loongarch64 \ -m 64G \ -cpu la464-loongarch-cpu \ -machine virt \ -smp 16 \ -bios ./QEMU_EFI_7.2.fd \ -nographic \ -device virtio-net,netdev=net0 \ -netdev user,id=net0,hostfwd=tcp::4444-:22 \ -hda gentoo.qcow2
Emulated/Virtualized environment
- Operating system: Gentoo Linux
- OS/kernel version: Linux 6.5
- Architecture: LoongArch
Description of problem
After enabling LASX in qemu (@gaosong), booting Gentoo Linux with latest glibc master (w/ LSX & LASX optimized libc routines) will fail in systemd:
[ 10.350207] Kernel panic - not syncing: Attempted to kill init! exitcode=0x00000085
[ 10.350557] CPU: 5 PID: 1 Comm: systemd Not tainted 6.5.2-gentoo #2
[ 10.350655] Hardware name: QEMU QEMU Virtual Machine, BIOS 0.0.0 02/06/2015
[ 10.350961] Stack : 0072617764726148 0000000000000000 9000000000223440 90000001000e4000
[ 10.351181] 90000001000e7990 90000001000e7998 0000000000000000 90000001000e7ad8
[ 10.351294] 90000001000e7ad0 90000001000e7ad0 90000001000e7900 0000000000000001
[ 10.351406] 0000000000000001 90000001000e7998 ec94a2e1446052e6 9000000100438140
[ 10.351519] 0000000000000001 0000000000000003 0000000000000000 0000000000000030
[ 10.351630] 0000000000000000 00000000000559bf 00000000056e0000 0000000000000004
[ 10.351745] 0000000000000000 0000000000000000 900000000162b438 900000000177e000
[ 10.351856] 00000000400004d8 0000000000000001 0000000000000018 90000001000e7c84
[ 10.351968] 0000000000020000 0000000000000000 9000000000223458 00007ffff0341af0
[ 10.352081] 00000000000000b0 0000000000000004 0000000000000000 0000000000071c1c
[ 10.352196] ...
[ 10.352277] Call Trace:
[ 10.352482] [<9000000000223458>] show_stack+0x5c/0x180
[ 10.353518] [<9000000001178d4c>] dump_stack_lvl+0x60/0x88
[ 10.353592] [<900000000115cd7c>] panic+0x13c/0x308
[ 10.353670] [<900000000024244c>] do_exit+0x860/0x868
[ 10.353735] [<900000000024261c>] do_group_exit+0x34/0x94
[ 10.353803] [<9000000000250514>] get_signal+0x75c/0x804
[ 10.353869] [<90000000002254c4>] arch_do_signal_or_restart+0x74/0xae0
[ 10.353944] [<90000000002c738c>] exit_to_user_mode_loop.isra.0+0x90/0x10c
[ 10.354041] [<9000000001179ff0>] irqentry_exit_to_user_mode+0x1c/0x28
[ 10.354119] [<90000000011792f8>] do_bp+0xcc/0x2ac
[ 10.354222] [<90000001005a1924>] 0x90000001005a1924
[ 10.354522] [<00007ffff0341af0>] 0x7ffff0341af0
Full log:
Instruction trace -d in_asm,out_asm,op
(very large):
I also tried to boot LoongArchLinux whose glibc does not have LSX/LASX optimized C routines, and it can boot without problems. If I chroot from LoongArchLinux into Gentoo Linux, running emerge
command will SIGSEGV.
If I disable LASX in CPUCFG2, the problem is gone:
// data = FIELD_DP32(data, CPUCFG2, LASX, 1),
I guess the bug is related to LASX assemblies in glibc.
Steps to reproduce
- Launch qemu
- Wait for systemd to be killed
- Collect logs