Page: 1, 2  Next

[RESOLVED] 5.7 kernels lock up when suspending or hibernating.
stevenpusser
Status: Contributor
Joined: 14 Jan 2017
Posts: 89
Reply Quote
This is on the Debian Buster-based MX 19 distro, 64-bit.

Tested on two different Intel-based laptops. The screen goes dark, but everything else stays on permanently, even the keyboard backlight, and I have to hold down the power button to restart. As an experiment, I rebuilt the 5.7-2 kernel with CONFIG_CHECKPOINT_RESTORE disabled, but it made no difference.

Any ideas?

< Edited by stevenpusser :: Jun 26, 20, 17:44 >

Back to top
damentz
Status: Assistant
Joined: 09 Sep 2008
Posts: 1122
Reply Quote
It might be the ACPI_TINY_POWER_BUTTON that I had added as a module. Compared to Arch's upstream configuration, it's disabled.

Since you're able to change the configuration and rebuild, do you mind testing with the changes I added to the 5.7 branch of liquorix-package? github.com/damentz/liquorix-package/commits/5.7

:: Code ::
4a6e8e2 config: Disable CONFIG_X86_X32, no one uses it
6b50596 config: Backport Arch Linux ACPI_*=y static configuration
84ea447 config: Disable "CONFIG_ACPI_TINY_POWER_BUTTON"


I'll give these changes a test later today once I have more time outside of work.
Back to top
stevenpusser
Status: Contributor
Joined: 14 Jan 2017
Posts: 89
Reply Quote
OK, I'll give it a try.
Back to top
damentz
Status: Assistant
Joined: 09 Sep 2008
Posts: 1122
Reply Quote
Well, I finally got a chance to try it. The kernel causes suspend issues on my work laptop too.

I'll keep looking for what might be causing it. Hopefully we get more reports and can nail down what's similar among them.
Back to top
stevenpusser
Status: Contributor
Joined: 14 Jan 2017
Posts: 89
Reply Quote
Sad to say, not building that TINY module didn't make any difference.

One is an Acer Intel Skylake 2016 laptop, and the other is an MSI Intel Coffee Lake laptop from 2018.
Back to top
damentz
Status: Assistant
Joined: 09 Sep 2008
Posts: 1122
Reply Quote
Yup, definitely not the ACPI module. I managed to determine that it's probably related to CPU hotplug support. Something changed in MuQSS for v5.7 that no longer works.

Here's the OOPS I get on my own laptop. I get this by using smt-manager (linked in forums), to disable logical SMT cores. Running the script creates the bug below:

:: Code ::
[  264.560048] ------------[ cut here ]------------
[  264.560049] kernel BUG at kernel/fork.c:698!
[  264.560058] invalid opcode: 0000 [#1] PREEMPT SMP NOPTI
[  264.560062] CPU: 7 PID: 96649 Comm: perl Tainted: G     U     O      5.7.6-lqx1-1-lqx #1
[  264.560063] Hardware name: LENOVO 20QD001VUS/20QD001VUS, BIOS N2HET50W (1.33 ) 05/13/2020
[  264.560069] RIP: 0010:__mmdrop+0xe6/0x110
[  264.560071] Code: 00 00 00 74 10 48 8b 3d 48 1c 7a 01 48 89 ee 5d e9 df 79 1e 00 e8 2a f7 0b 00 48 8b 3d 33 1c 7a 01 48 89 ee 5d e9 ca 79 1e 00 <0f> 0b 0f 0b e9 4e ff ff ff 48 89 ef e8 b9 43 1e 00 e9 63 ff ff ff
[  264.560072] RSP: 0018:ffffc90015b87d40 EFLAGS: 00010246
[  264.560074] RAX: ffff8881074c3800 RBX: 0000000000000005 RCX: ffffc90015b87c3f
[  264.560075] RDX: 00000000fffffffb RSI: 0000000000000035 RDI: ffffffff8226a6a0
[  264.560076] RBP: 0000000000000056 R08: ffff8881074c3d97 R09: 0000000000000000
[  264.560077] R10: ffffffffffffffff R11: ffff8881074c3d97 R12: 0000000000000000
[  264.560078] R13: 00000000000000d2 R14: ffffffff81098280 R15: ffff88846e559560
[  264.560080] FS:  00007f9c30dd5740(0000) GS:ffff88846e5c0000(0000) knlGS:0000000000000000
[  264.560081] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  264.560082] CR2: 000055c0d82f20d0 CR3: 00000003b1aa4002 CR4: 00000000003606e0
[  264.560083] Call Trace:
[  264.560088]  finish_cpu+0x2e/0x40
[  264.560092]  cpuhp_invoke_callback.constprop.0+0x75/0x470
[  264.560096]  ? cpuhp_kick_ap_work+0x6d/0x1e0
[  264.560099]  _cpu_down+0xac/0x200
[  264.560103]  cpu_device_down+0x2b/0x50
[  264.560106]  device_offline+0x7b/0xa0
[  264.560108]  online_store+0x9f/0x140
[  264.560112]  kernfs_fop_write+0xc9/0x1a0
[  264.560116]  vfs_write+0xf4/0x410
[  264.560119]  ? __x64_sys_newfstat+0x7b/0xa0
[  264.560122]  __x64_sys_write+0x50/0xc0
[  264.560125]  do_syscall_64+0x49/0x130
[  264.560129]  entry_SYSCALL_64_after_hwframe+0x44/0xa9
[  264.560133] RIP: 0033:0x7f9c31071b57
[  264.560136] Code: 0c 00 f7 d8 64 89 02 48 c7 c0 ff ff ff ff eb b7 0f 1f 00 f3 0f 1e fa 64 8b 04 25 18 00 00 00 85 c0 75 10 b8 01 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 51 c3 48 83 ec 28 48 89 54 24 18 48 89 74 24
[  264.560137] RSP: 002b:00007ffd7d9edfd8 EFLAGS: 00000246 ORIG_RAX: 0000000000000001
[  264.560140] RAX: ffffffffffffffda RBX: 000055e6a5cf5960 RCX: 00007f9c31071b57
[  264.560141] RDX: 0000000000000001 RSI: 000055e6a5cf5960 RDI: 0000000000000003
[  264.560142] RBP: 0000000000000001 R08: 0000000000000000 R09: 0000000000002000
[  264.560144] R10: 0000000000000004 R11: 0000000000000246 R12: 000055e6a5cf4720
[  264.560146] R13: 000055e6a56c92a0 R14: 0000000000000003 R15: 000055e6a5cf4720
[  264.560150] Modules linked in: fuse rfcomm ccm cmac algif_hash algif_skcipher af_alg af_packet msr bnep vmnet(O) nls_utf8 nls_cp437 vfat fat btusb btrtl btbcm btintel uvcvideo bluetooth ecdh_generic ecc videobuf2_vmalloc videobuf2_memops videobuf2_v4l2 videobuf2_common videodev mc snd_soc_skl_hda_dsp joydev snd_hda_codec_hdmi snd_soc_hdac_hdmi snd_hda_codec_realtek snd_hda_codec_generic x86_pkg_temp_thermal snd_soc_dmic intel_powerclamp coretemp snd_sof_pci snd_sof_intel_byt snd_sof_intel_ipc snd_sof_intel_hda_common kvm_intel hid_multitouch snd_soc_hdac_hda snd_sof_xtensa_dsp snd_sof_intel_hda snd_sof hid_generic snd_hda_ext_core snd_soc_acpi_intel_match snd_soc_acpi kvm snd_intel_dspcfg irqbypass snd_hda_codec crct10dif_pclmul i2c_designware_platform crc32_pclmul iTCO_wdt iTCO_vendor_support i2c_designware_core intel_rapl_msr ghash_clmulni_intel snd_hda_core aesni_intel intel_wmi_thunderbolt wmi_bmof crypto_simd snd_hwdep iwlmvm cryptd snd_soc_core glue_helper snd_compress mac80211
[  264.560192]  libarc4 ac97_bus rtc_cmos snd_pcm_dmaengine e1000e snd_pcm snd_timer mei_hdcp iwlwifi input_leds i2c_i801 thunderbolt psmouse processor_thermal_device intel_lpss_pci intel_lpss intel_rapl_common cfg80211 intel_soc_dts_iosf idma64 intel_pch_thermal thinkpad_acpi nvram i2c_hid ledtrig_audio rfkill hid ucsi_acpi wmi typec_ucsi typec snd soundcore int3403_thermal int340x_thermal_zone battery ac pinctrl_cannonlake pinctrl_intel int3400_thermal acpi_thermal_rel evdev acpi_cpufreq acpi_tad nf_log_ipv6 ip6t_REJECT nf_reject_ipv6 xt_hl ip6t_rt nf_log_ipv4 nf_log_common ipt_REJECT nf_reject_ipv4 xt_LOG xt_multiport xt_comment xt_limit xt_addrtype xt_tcpudp xt_conntrack ip6table_filter ip6_tables nf_conntrack_netbios_ns nf_conntrack_broadcast nf_nat_ftp nf_nat nf_conntrack_ftp nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 iptable_filter bpfilter zram vmmon(O) vmw_vmci vboxnetflt(O) vboxnetadp(O) vboxdrv(O) usbip_host usbip_core sg scsi_mod binfmt_misc crypto_user ip_tables x_tables autofs4
[  264.560238]  serio_raw xhci_pci xhci_hcd xfs libcrc32c i915 intel_gtt i2c_algo_bit drm_kms_helper cec rc_core drm i2c_core ext4 crc32c_generic crc32c_intel crc16 mbcache jbd2
[  264.560253] ---[ end trace 70db5bccbcfcbff0 ]---


At this point I have a few options:

1) Work with Con (more like wait until next kernel resync), to figure out what broke and fix hotplug support
2) Try resyncing MuQSS from v5.6 myself (this will need to wait until the weekend unless I get more time than expected this week)
3) Revert Liquorix entirely to v5.6 until the bug is fixed for v5.8 comes out (if everything else fails)
Back to top
stevenpusser
Status: Contributor
Joined: 14 Jan 2017
Posts: 89
Reply Quote
Thanks for the update!
Back to top
damentz
Status: Assistant
Joined: 09 Sep 2008
Posts: 1122
Reply Quote
Hey stevenpusser,

I believe I fixed the issue. Can you try the latest Liquorix kernel? My personal work laptop now suspends where it didn't before.
Back to top
stevenpusser
Status: Contributor
Joined: 14 Jan 2017
Posts: 89
Reply Quote
OK, will build and get back to you later.
Back to top
stevenpusser
Status: Contributor
Joined: 14 Jan 2017
Posts: 89
Reply Quote
That seems to have fixed the problem; marking this thread [RESOLVED]. Thanks!
Back to top
Display posts from previous:   
Page: 1, 2  Next
All times are GMT - 8 Hours