TI中文支持网
TI专业的中文技术问题搜集分享网站

AM5728 rcu_preempt 导致重启

AM5728 sdk5.0.2

打印如下log后板子重启,

[17:41:12] [ 57.232432] INFO: rcu_preempt detected stalls on CPUs/tasks:
[17:41:12] [ 57.238373] 1-…: (0 ticks this GP) idle=14a/140000000000000/0 softirq=15407/15407 fqs=0[17:41:12] [ 57.247117] (detected by 0, t=2102 jiffies, g=1197, c=1196, q=41)
[17:41:12] [ 57.253594] Sending NMI from CPU 0 to CPUs 1:
[17:41:22] [ 67.257281] rcu_preempt kthread starved for 2102 jiffies! g1197 c1196 f0x0 RCU_GP_WAIT_FQS(3) ->state=0x402 ->cpu=1
[17:41:22] [ 67.268211] rcu_preempt I 0 8 2 0x00000000
[17:41:22] [ 67.273961] Backtrace:[17:41:22] [ 67.276531] [<c0976768>] (__schedule) from [<c0976f60>] (schedule+0x58/0xc4)
[17:41:22] [ 67.283914] r10:c10181de r9:c10181d0 r8:c1002d00 r7:ee89bed8 r6:eed43580 r5:eed43580
[17:41:22] [ 67.292115] r4:ffffe000
[17:41:22] [ 67.294767] [<c0976f08>] (schedule) from [<c097a7b4>] (schedule_timeout+0x158/0x258)
[17:41:22] [ 67.302877] r5:eed43580 r4:ffff98f7
[17:41:22] [ 67.306625] [<c097a65c>] (schedule_timeout) from [<c028f988>] (rcu_gp_kthread+0x5a0/0x980)
[17:41:22] [ 67.315283] r8:00000001 r7:c10181dc r6:c1002d00 r5:00000001 r4:c1018080
[17:41:22] [ 67.322304] [<c028f3e8>] (rcu_gp_kthread) from [<c0248e28>] (kthread+0x164/0x16c)
[17:41:22] [ 67.330137] r7:ee89a000
[17:41:22] [ 67.332791] [<c0248cc4>] (kthread) from [<c0207d18>] (ret_from_fork+0x14/0x3c)
[17:41:22] [ 67.340358] r10:00000000 r9:00000000 r8:00000000 r7:00000000 r6:00000000 r5:c0248cc4
[17:41:22] [ 67.348556] r4:ee84ae40
[17:41:22] [ 67.351207] INFO: rcu_sched detected stalls on CPUs/tasks:
[17:41:22] [ 67.356952] 1-…: (1 GPs behind) idle=14a/140000000000000/0 softirq=15406/15407 fqs=34[17:41:22] [ 67.365515] (detected by 0, t=2102 jiffies, g=911, c=910, q=7)
[17:41:22] [ 67.371720] Sending NMI from CPU 0 to CPUs 1:
[17:41:32] [ 77.375403] rcu_sched kthread starved for 2034 jiffies! g911 c910 f0x0 RCU_GP_WAIT_FQS(3) ->state=0x0 ->cpu=0
[17:41:32] [ 77.385787] rcu_sched R running task 0 9 2 0x00000000
[17:41:32] [ 77.393172] Backtrace:[17:41:32] [ 77.395736] [<c0976768>] (__schedule) from [<c0976f60>] (schedule+0x58/0xc4)
[17:41:32] [ 77.403120] r10:c101841e r9:c1018410 r8:c1002d00 r7:ee89ded8 r6:eed34580 r5:eed34580
[17:41:32] [ 77.411319] r4:ffffe000
[17:41:32] [ 77.413970] [<c0976f08>] (schedule) from [<c097a7b4>] (schedule_timeout+0x158/0x258)
[17:41:32] [ 77.422078] r5:eed34580 r4:ffff993b
[17:41:32] [ 77.425822] [<c097a65c>] (schedule_timeout) from [<c028f988>] (rcu_gp_kthread+0x5a0/0x980)
[17:41:32] [ 77.434482] r8:00000000 r7:c101841c r6:c1002d00 r5:00000001 r4:c10182c0
[17:41:32] [ 77.441502] [<c028f3e8>] (rcu_gp_kthread) from [<c0248e28>] (kthread+0x164/0x16c)
[17:41:32] [ 77.449340] r7:ee89c000
[17:41:32] [ 77.451991] [<c0248cc4>] (kthread) from [<c0207d18>] (ret_from_fork+0x14/0x3c)
[17:41:32] [ 77.459555] r10:00000000 r9:00000000 r8:00000000 r7:00000000 r6:00000000 r5:c0248cc4
[17:41:32] [ 77.467757] r4:ee84ae40

user5875077:

自己顶

Nancy Wang:

回复 user5875077:

您是在执行了什么操作之后导致板子重启的?麻烦详细说一下。

user5875077:

回复 Nancy Wang:

您好,执行了一个摄像头采图+图像识别的算法的程序,CPU利用率接近80%。执行top发现,采图进程A cpu占比40%左右,算法进程B cpu占比40%左右。

Nancy Wang:

回复 user5875077:

您是用的TI提供的开发板还是自己的板子?如果用的是自己的板子建议您在TI的开发板上试一下。
您是用的RT-linux吗?能都用non-RT linux是一下是什么结果?
software-dl.ti.com/…/index_FDS.html
software-dl.ti.com/…/index_FDS.html

user5875077:

回复 Nancy Wang:

核心板是TI 的开发板,用的是NON-RT LINUx

user5875077:

回复 Nancy Wang:

出现死机的程序使用了TBB机制加速,没有tbb加速的程序则不会死机,请知悉

Nancy Wang:

回复 user5875077:

根据内核log打印信息来看,某个应用程序长时间加载CPU,导致内核CPU stall,启动了内核CPU stall检测器。
建议您阅读./<Linux_dir>/Documentation/RCU/stallwarn.txt中的指令,通过最小化cpu stall的时间来优化内核的配置。
这两个帖子也可以参考一下。
e2e.ti.com/…/581910
e2e.ti.com/…/694048

赞(0)
未经允许不得转载:TI中文支持网 » AM5728 rcu_preempt 导致重启
分享到: 更多 (0)