Fedora 16 kernel bugzilla status report from 2012-02-10 to 2012-02-17

494 bugs open in total. (up from 378 last week)
66 bugs closed.


Interesting closures:

Another couple dozen wireless driver dupes. The most commonly reported bug is still 768639: WARNING: at /builddir/build/BUILD/kernel-3.1.fc17/compat-wireless-2011-12-01/drivers/net/wireless/ath/ath9k/rc.c:697 ath_rc_get_highest_rix+0x158/0x1f0 [ath9k]()

Second most common source of dupe bugs this week was the i915 driver, which grew several new problems.
772886: WARNING: at drivers/gpu/drm/i915/intel_display.c:953 intel_disable_pipe+0x120/0x150 [i915]()
790701: WARNING: at drivers/gpu/drm/i915/intel_dp.c:344 intel_dp_check_edp+0x65/0xb0 [i915]()
790702: WARNING: at drivers/gpu/drm/i915/intel_dp.c:1006 ironlake_edp_panel_vdd_on+0x197/0x1a0 [i915]()

739499: kernel-3.1.0-0.rc6.git0.3.fc16.x86_64 won’t boot on EC2
Fix picked up in the 3.2 rebase.

We saw a few more dupes of the sysfs link remove warning.
This is queued up for the next update already.

In last week’s f16 report, under ‘totally weird shit’, I pointed at bug 787527: kernel BUG at mm/mmap.c:2378!. At the time I had no idea what was happening. Over the week, we got several more reports. Hugh Dickins chased this down to a locking bug in the transparent huge pages code. (upstream thread).
We’ll pull in the final fix for that in the next update.

We’ve had a number of bugs reported from the soft lockup detector firing. When this happens, the traces in a lot of cases don’t make a lot of sense.
The common thing seems to be that they are all using some form of virtualisation. Here’s one from vmware for example (though our first f17 kernel bug is the same problem, but in qemu). For now, booting guests with nosoftlockup is probably the best we can do. There is some work ongoing upstream to better handle this situation.
TODO: Go through all the rest of the soft lockup bugs and see if any of them are the same problem. (likely).

772649: Frequency not scaling on demand – Sandy Bridge
We’ve seen all kinds of power management disasters on sandybridge systems.
From the ongoing i915 rc6 fiasco, to BIOS bonghits that take away P-states when things get too hot.
I suspect it’s all related.

790097: your kernel is tainted by flags
We got so many tainted bug reports that we don’t care about automatically filed by abrt, we had the abrt guys put in a dialog explaining to users that it wasn’t going to file bugs.
So naturally, users have started filing them by hand. Derp.

Just like in F15, we got a bunch more reports of the sd_revalidate_disk bug.
The fix for which is going to be in next weeks update.


91 still-open bugs got filed, or changed in some way.. Of those, here’s some of the more interesting ones.

428555: Soft lockup while doing load_policy
A very old SELinux bug, where loading the policy takes a really long time.
A cond_resched would silence the soft lockup detector, but I’m really curious why it’s taking 22 seconds to load a policy.
Something really doesn’t add up here. AFAIK, this is the only report we’ve ever had of a policy load taking this long.

593035: mount.nfs: page allocation failure. order:4, mode:0xc0d0
The new NFS idmapper code should fix this problem, but is only just getting tested in f17.
Once it’s proven itself there, we’ll look at backporting whatever is necessary to 16. (f15 is likely to be EOL at that point).
This will require userspace updates, which is another reason it won’t be happening in f15.

There were a bunch more irqpoll bugs reported. Still no resolution on the automatic fallback-to-polling idea upstream.

assorted wireless:
746744: Can not connect to PEAP using Intel Corporation WiFi Link 5100
755370: ath9k stability issues
767855: Wifi performance issues (Tx aggregation enabled on ra=MAC)
768639: WARNING: at /builddir/build/BUILD/kernel-3.1.fc17/compat-wireless-2011-12-01/drivers/net/wireless/ath/ath9k/rc.c:697 ath_rc_get_highest_rix+0x158/0x1f0 [ath9k]()
770484: WARNING: at /builddir/build/BUILD/kernel-3.1.fc16/compat-wireless-3.2-rc6-3/drivers/net/wireless/iwlwifi/iwl-trans-pcie-tx.c:739 iwl_enqueue_hcmd+0x5c8/0x5f0 [iwlwifi]()
770595: WARNING: at /builddir/build/BUILD/kernel-3.1.fc16/compat-wireless-3.2-rc6-3/drivers/net/wireless/iwlwifi/iwl-trans-pcie-rx.c:461 iwl_irq_tasklet+0x3bd/0x7c0 [iwlwifi]()
773513: Wifi wireless network connection abruptly stop working
773652: [ath9k] randomly disconnects wireless[AR9285] — lenovo g475
785422: Wireless fails after kernel update to kernel 3.2.2.1 pae
785561: 3.2.5-3.fc16.x86_64/X53S/K53SV iwlwifi runs like a sloth compared to ath9k
785913: WARNING: at /builddir/build/BUILD/kernel-3.2.fc16/compat-wireless-3.3-rc1-2/drivers/net/wireless/iwlwifi/iwl-agn-tx.c:396 iwlagn_tx_skb+0x98d/0xa10 [iwlwifi]()
786609: WARNING: at /builddir/build/BUILD/kernel-3.2.fc16/compat-wireless-3.3-rc1-2/include/net/mac80211.h:3618 rate_control_send_low+0x23e/0x250 [mac80211]()
787649: WARNING: at /builddir/build/BUILD/kernel-3.2.fc16/compat-wireless-3.3-rc1-2/drivers/net/wireless/brcm80211/brcmsmac/main.c:7998 brcms_c_wait_for_tx_completion+0x99/0xb0 [brcmsmac]()
788012: WARNING: at /builddir/build/BUILD/kernel-3.2.fc16/compat-wireless-3.3-rc1-2/net/mac80211/driver-ops.h:10 ieee80211_bss_info_change_notify+0x28a/0x290 [mac80211]()
789605: rtl8192cu: After 5~6 minutes, wireless usb lancard doesn’t work (cannot connect internet).
789159: network connection failure
790810: ath5k port gets “hard blocked” when Wireless is disabled via NetworkManager
794710: ath9k: Cannot enable WiFi in gnome-shell (toggle button switches back to Off state)
790275: BUG: unable to handle kernel NULL pointer dereference at 0000000000000060 rtl92ce_get_desc()

ethernet:
625776: e1000e crashes with Intel 82574L
720207: Realtek rtl8188ce works slow: speed is around 1Mb/s
794788: Wake-On-LAN stopped working after upgrade from FC15 to FC16
781217: crash after unplugging DSL cable (atl1c?)

suspend/hibernate:
783032: Can’t suspend to RAM when IR dongle is allowed to wake
788433: Core i7 cannot pm-hibernate/pm-suspend/thaw properly
789699: suspend fails by instant resume
791149: System can’t be suspended with kernel 3.2.x
791267: System reboots immediately after hibernating with 3.2 kernels
794525: 3.2.6-3.fc16.x86_64 doesnt suspend
789708: Hibernating fails all time
767084: kernel crash after back from sleep

boot failures:
789536: Can’t boot into new kernel
789679: kernel 3.2.3-2, 3.2.5-3 won’t boot encrypted setup
791133: Fedora 16 doesn’t boot with 3.2.6-3.fc16.x86_64 kernel on my notebook

Misc oopses/warn_on’s/scary shit:
721127: Heavy disk I/O (MD RAID?) crashes or freezes Fedora 15
787862: WARNING: at fs/sysfs/inode.c:323 sysfs_hash_and_remove+0xa9/0xb0()
788706: WARNING: at block/genhd.c:1568 disk_clear_events+0x106/0x110()
791277: WARNING: at kernel/watchdog.c:241 watchdog_overflow_callback+0x9b/0xa6()
794692: WARNING: at fs/dcache.c:2485 prepend_path+0x18c/0x1a0()
789990: BUG: unable to handle kernel paging request at 000000011b02e000 vmap_page_range_noflush()
790013: BUG: unable to handle kernel NULL pointer dereference at 0000000000000008 path_lookupat()
794531: BUG: unable to handle kernel NULL pointer dereference at (null) sock_init_data()
769576: WARNING: at lib/list_debug.c:56 __list_del_entry+0x82/0xd0()
770581: WARNING: at kernel/softirq.c:159 local_bh_enable_ip+0x7a/0xa0()
771794: kernel: general protection fault: 0000 [#1] SMP
772738: system crash
755334: Kernel freezes, loops audio, under moderate cpu load
788981: map_vm_area() BUG: unable to handle kernel paging request at 000000011b02e000
789993: BUG: Bad page state in process tar pfn:27d36
794639: BUG: Bad page map in process firefox pte:02126065 pmd:19f00067

btrfs:
789632: WARNING: at fs/btrfs/extent-tree.c:5985 btrfs_alloc_free_block+0x354/0x360 [btrfs]()
790297: kernel BUG at fs/btrfs/transaction.c:1337!
790232: untarring incredibly slow over NFS even worse with BTRFS export

Brightness changing seems broken:
702352: Brightness adjustment FN keys doesn’t work
784532: kernel-3.2.1-3.fc16.x86_64 seems to break settings-screen- brightness slider control so disappears
788675: acpi_video_device_lcd_get_level_current() BUG: unable to handle kernel NULL pointer dereference at 0000000000000009
789962: Cannot adjust the brightness of the display in my laptop by pressing the function key