Changelog in Linux kernel 6.1.139

arm64: bpf: Add BHB mitigation to the epilogue for cBPF programs [+ + +]

Author: James Morse <james.morse@arm.com>
Date:   Thu Dec 9 15:13:24 2021 +0000

    arm64: bpf: Add BHB mitigation to the epilogue for cBPF programs
    
    commit 0dfefc2ea2f29ced2416017d7e5b1253a54c2735 upstream.
    
    A malicious BPF program may manipulate the branch history to influence
    what the hardware speculates will happen next.
    
    On exit from a BPF program, emit the BHB mititgation sequence.
    
    This is only applied for 'classic' cBPF programs that are loaded by
    seccomp.
    
    Signed-off-by: James Morse <james.morse@arm.com>
    Reviewed-by: Catalin Marinas <catalin.marinas@arm.com>
    Acked-by: Daniel Borkmann <daniel@iogearbox.net>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

arm64: bpf: Only mitigate cBPF programs loaded by unprivileged users [+ + +]

Author: James Morse <james.morse@arm.com>
Date:   Tue Apr 29 16:03:38 2025 +0100

    arm64: bpf: Only mitigate cBPF programs loaded by unprivileged users
    
    commit f300769ead032513a68e4a02e806393402e626f8 upstream.
    
    Support for eBPF programs loaded by unprivileged users is typically
    disabled. This means only cBPF programs need to be mitigated for BHB.
    
    In addition, only mitigate cBPF programs that were loaded by an
    unprivileged user. Privileged users can also load the same program
    via eBPF, making the mitigation pointless.
    
    Signed-off-by: James Morse <james.morse@arm.com>
    Reviewed-by: Catalin Marinas <catalin.marinas@arm.com>
    Acked-by: Daniel Borkmann <daniel@iogearbox.net>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

arm64: dts: imx8mm-verdin: Link reg_usdhc2_vqmmc to usdhc2 [+ + +]

Author: Wojciech Dubowik <Wojciech.Dubowik@mt.com>
Date:   Thu Apr 24 11:59:14 2025 +0200

    arm64: dts: imx8mm-verdin: Link reg_usdhc2_vqmmc to usdhc2
    
    commit 5591ce0069ddda97cdbbea596bed53e698f399c2 upstream.
    
    Define vqmmc regulator-gpio for usdhc2 with vin-supply
    coming from LDO5.
    
    Without this definition LDO5 will be powered down, disabling
    SD card after bootup. This has been introduced in commit
    f5aab0438ef1 ("regulator: pca9450: Fix enable register for LDO5").
    
    Fixes: 6a57f224f734 ("arm64: dts: freescale: add initial support for verdin imx8m mini")
    Fixes: f5aab0438ef1 ("regulator: pca9450: Fix enable register for LDO5")
    Tested-by: Manuel Traut <manuel.traut@mt.com>
    Reviewed-by: Philippe Schenker <philippe.schenker@impulsing.ch>
    Tested-by: Francesco Dolcini <francesco.dolcini@toradex.com>
    Reviewed-by: Francesco Dolcini <francesco.dolcini@toradex.com>
    Cc: stable@vger.kernel.org
    Signed-off-by: Wojciech Dubowik <Wojciech.Dubowik@mt.com>
    Signed-off-by: Shawn Guo <shawnguo@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

arm64: insn: Add support for encoding DSB [+ + +]

Author: James Morse <james.morse@arm.com>
Date:   Thu Dec 9 15:12:19 2021 +0000

    arm64: insn: Add support for encoding DSB
    
    commit 63de8abd97ddb9b758bd8f915ecbd18e1f1a87a0 upstream.
    
    To generate code in the eBPF epilogue that uses the DSB instruction,
    insn.c needs a heler to encode the type and domain.
    
    Re-use the crm encoding logic from the DMB instruction.
    
    Signed-off-by: James Morse <james.morse@arm.com>
    Reviewed-by: Catalin Marinas <catalin.marinas@arm.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

arm64: proton-pack: Add new CPUs 'k' values for branch mitigation [+ + +]

Author: James Morse <james.morse@arm.com>
Date:   Mon Aug 12 17:50:22 2024 +0100

    arm64: proton-pack: Add new CPUs 'k' values for branch mitigation
    
    commit efe676a1a7554219eae0b0dcfe1e0cdcc9ef9aef upstream.
    
    Update the list of 'k' values for the branch mitigation from arm's
    website.
    
    Add the values for Cortex-X1C. The MIDR_EL1 value can be found here:
    https://developer.arm.com/documentation/101968/0002/Register-descriptions/AArch>
    
    Link: https://developer.arm.com/documentation/110280/2-0/?lang=en
    Signed-off-by: James Morse <james.morse@arm.com>
    Reviewed-by: Catalin Marinas <catalin.marinas@arm.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

arm64: proton-pack: Expose whether the branchy loop k value [+ + +]

Author: James Morse <james.morse@arm.com>
Date:   Tue Apr 29 13:55:17 2025 +0100

    arm64: proton-pack: Expose whether the branchy loop k value
    
    commit a1152be30a043d2d4dcb1683415f328bf3c51978 upstream.
    
    Add a helper to expose the k value of the branchy loop. This is needed
    by the BPF JIT to generate the mitigation sequence in BPF programs.
    
    Signed-off-by: James Morse <james.morse@arm.com>
    Reviewed-by: Catalin Marinas <catalin.marinas@arm.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

arm64: proton-pack: Expose whether the platform is mitigated by firmware [+ + +]

Author: James Morse <james.morse@arm.com>
Date:   Mon Aug 19 14:15:53 2024 +0100

    arm64: proton-pack: Expose whether the platform is mitigated by firmware
    
    commit e7956c92f396a44eeeb6eaf7a5b5e1ad24db6748 upstream.
    
    is_spectre_bhb_fw_affected() allows the caller to determine if the CPU
    is known to need a firmware mitigation. CPUs are either on the list
    of CPUs we know about, or firmware has been queried and reported that
    the platform is affected - and mitigated by firmware.
    
    This helper is not useful to determine if the platform is mitigated
    by firmware. A CPU could be on the know list, but the firmware may
    not be implemented. Its affected but not mitigated.
    
    spectre_bhb_enable_mitigation() handles this distinction by checking
    the firmware state before enabling the mitigation.
    
    Add a helper to expose this state. This will be used by the BPF JIT
    to determine if calling firmware for a mitigation is necessary and
    supported.
    
    Signed-off-by: James Morse <james.morse@arm.com>
    Reviewed-by: Catalin Marinas <catalin.marinas@arm.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

bpf: Scrub packet on bpf_redirect_peer [+ + +]

Author: Paul Chaignon <paul.chaignon@gmail.com>
Date:   Mon May 5 21:58:04 2025 +0200

    bpf: Scrub packet on bpf_redirect_peer
    
    [ Upstream commit c4327229948879814229b46aa26a750718888503 ]
    
    When bpf_redirect_peer is used to redirect packets to a device in
    another network namespace, the skb isn't scrubbed. That can lead skb
    information from one namespace to be "misused" in another namespace.
    
    As one example, this is causing Cilium to drop traffic when using
    bpf_redirect_peer to redirect packets that just went through IPsec
    decryption to a container namespace. The following pwru trace shows (1)
    the packet path from the host's XFRM layer to the container's XFRM
    layer where it's dropped and (2) the number of active skb extensions at
    each function.
    
        NETNS       MARK  IFACE  TUPLE                                FUNC
        4026533547  d00   eth0   10.244.3.124:35473->10.244.2.158:53  xfrm_rcv_cb
                                 .active_extensions = (__u8)2,
        4026533547  d00   eth0   10.244.3.124:35473->10.244.2.158:53  xfrm4_rcv_cb
                                 .active_extensions = (__u8)2,
        4026533547  d00   eth0   10.244.3.124:35473->10.244.2.158:53  gro_cells_receive
                                 .active_extensions = (__u8)2,
        [...]
        4026533547  0     eth0   10.244.3.124:35473->10.244.2.158:53  skb_do_redirect
                                 .active_extensions = (__u8)2,
        4026534999  0     eth0   10.244.3.124:35473->10.244.2.158:53  ip_rcv
                                 .active_extensions = (__u8)2,
        4026534999  0     eth0   10.244.3.124:35473->10.244.2.158:53  ip_rcv_core
                                 .active_extensions = (__u8)2,
        [...]
        4026534999  0     eth0   10.244.3.124:35473->10.244.2.158:53  udp_queue_rcv_one_skb
                                 .active_extensions = (__u8)2,
        4026534999  0     eth0   10.244.3.124:35473->10.244.2.158:53  __xfrm_policy_check
                                 .active_extensions = (__u8)2,
        4026534999  0     eth0   10.244.3.124:35473->10.244.2.158:53  __xfrm_decode_session
                                 .active_extensions = (__u8)2,
        4026534999  0     eth0   10.244.3.124:35473->10.244.2.158:53  security_xfrm_decode_session
                                 .active_extensions = (__u8)2,
        4026534999  0     eth0   10.244.3.124:35473->10.244.2.158:53  kfree_skb_reason(SKB_DROP_REASON_XFRM_POLICY)
                                 .active_extensions = (__u8)2,
    
    In this case, there are no XFRM policies in the container's network
    namespace so the drop is unexpected. When we decrypt the IPsec packet,
    the XFRM state used for decryption is set in the skb extensions. This
    information is preserved across the netns switch. When we reach the
    XFRM policy check in the container's netns, __xfrm_policy_check drops
    the packet with LINUX_MIB_XFRMINNOPOLS because a (container-side) XFRM
    policy can't be found that matches the (host-side) XFRM state used for
    decryption.
    
    This patch fixes this by scrubbing the packet when using
    bpf_redirect_peer, as is done on typical netns switches via veth
    devices except skb->mark and skb->tstamp are not zeroed.
    
    Fixes: 9aa1206e8f482 ("bpf: Add redirect_peer helper")
    Signed-off-by: Paul Chaignon <paul.chaignon@gmail.com>
    Acked-by: Daniel Borkmann <daniel@iogearbox.net>
    Acked-by: Martin KaFai Lau <martin.lau@kernel.org>
    Link: https://patch.msgid.link/1728ead5e0fe45e7a6542c36bd4e3ca07a73b7d6.1746460653.git.paul.chaignon@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

can: gw: fix RCU/BH usage in cgw_create_job() [+ + +]

Author: Oliver Hartkopp <socketcan@hartkopp.net>
Date:   Tue Apr 29 09:05:55 2025 +0200

    can: gw: fix RCU/BH usage in cgw_create_job()
    
    [ Upstream commit 511e64e13d8cc72853275832e3f372607466c18c ]
    
    As reported by Sebastian Andrzej Siewior the use of local_bh_disable()
    is only feasible in uni processor systems to update the modification rules.
    The usual use-case to update the modification rules is to update the data
    of the modifications but not the modification types (AND/OR/XOR/SET) or
    the checksum functions itself.
    
    To omit additional memory allocations to maintain fast modification
    switching times, the modification description space is doubled at gw-job
    creation time so that only the reference to the active modification
    description is changed under rcu protection.
    
    Rename cgw_job::mod to cf_mod and make it a RCU pointer. Allocate in
    cgw_create_job() and free it together with cgw_job in
    cgw_job_free_rcu(). Update all users to dereference cgw_job::cf_mod with
    a RCU accessor and if possible once.
    
    [bigeasy: Replace mod1/mod2 from the Oliver's original patch with dynamic
    allocation, use RCU annotation and accessor]
    
    Reported-by: Sebastian Andrzej Siewior <bigeasy@linutronix.de>
    Closes: https://lore.kernel.org/linux-can/20231031112349.y0aLoBrz@linutronix.de/
    Fixes: dd895d7f21b2 ("can: cangw: introduce optional uid to reference created routing jobs")
    Tested-by: Oliver Hartkopp <socketcan@hartkopp.net>
    Signed-off-by: Oliver Hartkopp <socketcan@hartkopp.net>
    Signed-off-by: Sebastian Andrzej Siewior <bigeasy@linutronix.de>
    Link: https://patch.msgid.link/20250429070555.cs-7b_eZ@linutronix.de
    Signed-off-by: Marc Kleine-Budde <mkl@pengutronix.de>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

can: mcan: m_can_class_unregister(): fix order of unregistration calls [+ + +]

Author: Marc Kleine-Budde <mkl@pengutronix.de>
Date:   Fri May 2 16:13:46 2025 +0200

    can: mcan: m_can_class_unregister(): fix order of unregistration calls
    
    commit 0713a1b3276b98c7dafbeefef00d7bc3a9119a84 upstream.
    
    If a driver is removed, the driver framework invokes the driver's
    remove callback. A CAN driver's remove function calls
    unregister_candev(), which calls net_device_ops::ndo_stop further down
    in the call stack for interfaces which are in the "up" state.
    
    The removal of the module causes a warning, as can_rx_offload_del()
    deletes the NAPI, while it is still active, because the interface is
    still up.
    
    To fix the warning, first unregister the network interface, which
    calls net_device_ops::ndo_stop, which disables the NAPI, and then call
    can_rx_offload_del().
    
    Fixes: 1be37d3b0414 ("can: m_can: fix periph RX path: use rx-offload to ensure skbs are sent from softirq context")
    Cc: stable@vger.kernel.org
    Link: https://patch.msgid.link/20250502-can-rx-offload-del-v1-3-59a9b131589d@pengutronix.de
    Reviewed-by: Markus Schneider-Pargmann <msp@baylibre.com>
    Signed-off-by: Marc Kleine-Budde <mkl@pengutronix.de>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

can: mcp251xfd: fix TDC setting for low data bit rates [+ + +]

Author: Kelsey Maes <kelsey@vpprocess.com>
Date:   Wed Apr 30 09:15:01 2025 -0700

    can: mcp251xfd: fix TDC setting for low data bit rates
    
    [ Upstream commit 5e1663810e11c64956aa7e280cf74b2f3284d816 ]
    
    The TDC is currently hardcoded enabled. This means that even for lower
    CAN-FD data bitrates (with a DBRP (data bitrate prescaler) > 2) a TDC
    is configured. This leads to a bus-off condition.
    
    ISO 11898-1 section 11.3.3 says "Transmitter delay compensation" (TDC)
    is only applicable if DBRP is 1 or 2.
    
    To fix the problem, switch the driver to use the TDC calculation
    provided by the CAN driver framework (which respects ISO 11898-1
    section 11.3.3). This has the positive side effect that userspace can
    control TDC as needed.
    
    Demonstration of the feature in action:
    | $ ip link set can0 up type can bitrate 125000 dbitrate 500000 fd on
    | $ ip -details link show can0
    | 3: can0: <NOARP,UP,LOWER_UP,ECHO> mtu 72 qdisc pfifo_fast state UP mode DEFAULT group default qlen 10
    |     link/can  promiscuity 0  allmulti 0 minmtu 0 maxmtu 0
    |     can <FD> state ERROR-ACTIVE (berr-counter tx 0 rx 0) restart-ms 0
    |         bitrate 125000 sample-point 0.875
    |         tq 50 prop-seg 69 phase-seg1 70 phase-seg2 20 sjw 10 brp 2
    |         mcp251xfd: tseg1 2..256 tseg2 1..128 sjw 1..128 brp 1..256 brp_inc 1
    |         dbitrate 500000 dsample-point 0.875
    |         dtq 125 dprop-seg 6 dphase-seg1 7 dphase-seg2 2 dsjw 1 dbrp 5
    |         mcp251xfd: dtseg1 1..32 dtseg2 1..16 dsjw 1..16 dbrp 1..256 dbrp_inc 1
    |         tdcv 0..63 tdco 0..63
    |         clock 40000000 numtxqueues 1 numrxqueues 1 gso_max_size 65536 gso_max_segs 65535 tso_max_size 65536 tso_max_segs 65535 gro_max_size 65536 parentbus spi parentdev spi0.0
    | $ ip link set can0 up type can bitrate 1000000 dbitrate 4000000 fd on
    | $ ip -details link show can0
    | 3: can0: <NOARP,UP,LOWER_UP,ECHO> mtu 72 qdisc pfifo_fast state UP mode DEFAULT group default qlen 10
    |     link/can  promiscuity 0  allmulti 0 minmtu 0 maxmtu 0
    |     can <FD,TDC-AUTO> state ERROR-ACTIVE (berr-counter tx 0 rx 0) restart-ms 0
    |         bitrate 1000000 sample-point 0.750
    |         tq 25 prop-seg 14 phase-seg1 15 phase-seg2 10 sjw 5 brp 1
    |         mcp251xfd: tseg1 2..256 tseg2 1..128 sjw 1..128 brp 1..256 brp_inc 1
    |         dbitrate 4000000 dsample-point 0.700
    |         dtq 25 dprop-seg 3 dphase-seg1 3 dphase-seg2 3 dsjw 1 dbrp 1
    |         tdco 7
    |         mcp251xfd: dtseg1 1..32 dtseg2 1..16 dsjw 1..16 dbrp 1..256 dbrp_inc 1
    |         tdcv 0..63 tdco 0..63
    |         clock 40000000 numtxqueues 1 numrxqueues 1 gso_max_size 65536 gso_max_segs 65535 tso_max_size 65536 tso_max_segs 65535 gro_max_size 65536 parentbus spi parentdev spi0.0
    
    There has been some confusion about the MCP2518FD using a relative or
    absolute TDCO due to the datasheet specifying a range of [-64,63]. I
    have a custom board with a 40 MHz clock and an estimated loop delay of
    100 to 216 ns. During testing at a data bit rate of 4 Mbit/s I found
    that using can_get_relative_tdco() resulted in bus-off errors. The
    final TDCO value was 1 which corresponds to a 10% SSP in an absolute
    configuration. This behavior is expected if the TDCO value is really
    absolute and not relative. Using priv->can.tdc.tdco instead results in
    a final TDCO of 8, setting the SSP at exactly 80%. This configuration
    works.
    
    The automatic, manual, and off TDC modes were tested at speeds up to,
    and including, 8 Mbit/s on real hardware and behave as expected.
    
    Fixes: 55e5b97f003e ("can: mcp25xxfd: add driver for Microchip MCP25xxFD SPI CAN")
    Reported-by: Kelsey Maes <kelsey@vpprocess.com>
    Closes: https://lore.kernel.org/all/C2121586-C87F-4B23-A933-845362C29CA1@vpprocess.com
    Reviewed-by: Vincent Mailhol <mailhol.vincent@wanadoo.fr>
    Signed-off-by: Kelsey Maes <kelsey@vpprocess.com>
    Link: https://patch.msgid.link/20250430161501.79370-1-kelsey@vpprocess.com
    [mkl: add comment]
    Signed-off-by: Marc Kleine-Budde <mkl@pengutronix.de>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

can: mcp251xfd: mcp251xfd_remove(): fix order of unregistration calls [+ + +]

Author: Marc Kleine-Budde <mkl@pengutronix.de>
Date:   Fri May 2 16:13:44 2025 +0200

    can: mcp251xfd: mcp251xfd_remove(): fix order of unregistration calls
    
    commit 84f5eb833f53ae192baed4cfb8d9eaab43481fc9 upstream.
    
    If a driver is removed, the driver framework invokes the driver's
    remove callback. A CAN driver's remove function calls
    unregister_candev(), which calls net_device_ops::ndo_stop further down
    in the call stack for interfaces which are in the "up" state.
    
    With the mcp251xfd driver the removal of the module causes the
    following warning:
    
    | WARNING: CPU: 0 PID: 352 at net/core/dev.c:7342 __netif_napi_del_locked+0xc8/0xd8
    
    as can_rx_offload_del() deletes the NAPI, while it is still active,
    because the interface is still up.
    
    To fix the warning, first unregister the network interface, which
    calls net_device_ops::ndo_stop, which disables the NAPI, and then call
    can_rx_offload_del().
    
    Fixes: 55e5b97f003e ("can: mcp25xxfd: add driver for Microchip MCP25xxFD SPI CAN")
    Cc: stable@vger.kernel.org
    Link: https://patch.msgid.link/20250502-can-rx-offload-del-v1-1-59a9b131589d@pengutronix.de
    Signed-off-by: Marc Kleine-Budde <mkl@pengutronix.de>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

dm: add missing unlock on in dm_keyslot_evict() [+ + +]

Author: Dan Carpenter <dan.carpenter@linaro.org>
Date:   Wed Apr 30 11:05:54 2025 +0300

    dm: add missing unlock on in dm_keyslot_evict()
    
    commit 650266ac4c7230c89bcd1307acf5c9c92cfa85e2 upstream.
    
    We need to call dm_put_live_table() even if dm_get_live_table() returns
    NULL.
    
    Fixes: 9355a9eb21a5 ("dm: support key eviction from keyslot managers of underlying devices")
    Cc: stable@vger.kernel.org      # v5.12+
    Signed-off-by: Dan Carpenter <dan.carpenter@linaro.org>
    Signed-off-by: Mikulas Patocka <mpatocka@redhat.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

do_umount(): add missing barrier before refcount checks in sync case [+ + +]

Author: Al Viro <viro@zeniv.linux.org.uk>
Date:   Mon Apr 28 23:56:14 2025 -0400

    do_umount(): add missing barrier before refcount checks in sync case
    
    [ Upstream commit 65781e19dcfcb4aed1167d87a3ffcc2a0c071d47 ]
    
    do_umount() analogue of the race fixed in 119e1ef80ecf "fix
    __legitimize_mnt()/mntput() race".  Here we want to make sure that
    if __legitimize_mnt() doesn't notice our lock_mount_hash(), we will
    notice their refcount increment.  Harder to hit than mntput_no_expire()
    one, fortunately, and consequences are milder (sync umount acting
    like umount -l on a rare race with RCU pathwalk hitting at just the
    wrong time instead of use-after-free galore mntput_no_expire()
    counterpart used to be hit).  Still a bug...
    
    Fixes: 48a066e72d97 ("RCU'd vfsmounts")
    Reviewed-by: Christian Brauner <brauner@kernel.org>
    Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

Documentation: x86/bugs/its: Add ITS documentation [+ + +]

Author: Pawan Gupta <pawan.kumar.gupta@linux.intel.com>
Date:   Fri Apr 11 15:36:38 2025 -0700

    Documentation: x86/bugs/its: Add ITS documentation
    
    commit 1ac116ce6468670eeda39345a5585df308243dca upstream.
    
    Add the admin-guide for Indirect Target Selection (ITS).
    
    Signed-off-by: Pawan Gupta <pawan.kumar.gupta@linux.intel.com>
    Signed-off-by: Dave Hansen <dave.hansen@linux.intel.com>
    Reviewed-by: Josh Poimboeuf <jpoimboe@kernel.org>
    Reviewed-by: Alexandre Chartre <alexandre.chartre@oracle.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

drm/amd/display: Copy AUX read reply data whenever length > 0 [+ + +]

Author: Wayne Lin <Wayne.Lin@amd.com>
Date:   Sun Apr 20 17:50:03 2025 +0800

    drm/amd/display: Copy AUX read reply data whenever length > 0
    
    commit 3924f45d4de7250a603fd7b50379237a6a0e5adf upstream.
    
    [Why]
    amdgpu_dm_process_dmub_aux_transfer_sync() should return all exact data
    reply from the sink side. Don't do the analysis job in it.
    
    [How]
    Remove unnecessary check condition AUX_TRANSACTION_REPLY_AUX_ACK.
    
    Fixes: ead08b95fa50 ("drm/amd/display: Fix race condition in DPIA AUX transfer")
    Cc: Mario Limonciello <mario.limonciello@amd.com>
    Cc: Alex Deucher <alexander.deucher@amd.com>
    Reviewed-by: Ray Wu <ray.wu@amd.com>
    Signed-off-by: Wayne Lin <Wayne.Lin@amd.com>
    Signed-off-by: Ray Wu <ray.wu@amd.com>
    Tested-by: Daniel Wheeler <daniel.wheeler@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    (cherry picked from commit 9b540e3fe6796fec4fb1344f3be8952fc2f084d4)
    Cc: stable@vger.kernel.org
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

drm/amd/display: Fix the checking condition in dmub aux handling [+ + +]

Author: Wayne Lin <Wayne.Lin@amd.com>
Date:   Sun Apr 20 16:29:07 2025 +0800

    drm/amd/display: Fix the checking condition in dmub aux handling
    
    commit bc70e11b550d37fbd9eaed0f113ba560894f1609 upstream.
    
    [Why & How]
    Fix the checking condition for detecting AUX_RET_ERROR_PROTOCOL_ERROR.
    It was wrongly checking by "not equals to"
    
    Reviewed-by: Ray Wu <ray.wu@amd.com>
    Signed-off-by: Wayne Lin <Wayne.Lin@amd.com>
    Signed-off-by: Ray Wu <ray.wu@amd.com>
    Tested-by: Daniel Wheeler <daniel.wheeler@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    (cherry picked from commit 1db6c9e9b62e1a8912f0a281c941099fca678da3)
    Cc: stable@vger.kernel.org
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

drm/amd/display: Fix wrong handling for AUX_DEFER case [+ + +]

Author: Wayne Lin <Wayne.Lin@amd.com>
Date:   Sun Apr 20 19:22:14 2025 +0800

    drm/amd/display: Fix wrong handling for AUX_DEFER case
    
    commit 65924ec69b29296845c7f628112353438e63ea56 upstream.
    
    [Why]
    We incorrectly ack all bytes get written when the reply actually is defer.
    When it's defer, means sink is not ready for the request. We should
    retry the request.
    
    [How]
    Only reply all data get written when receive I2C_ACK|AUX_ACK. Otherwise,
    reply the number of actual written bytes received from the sink.
    Add some messages to facilitate debugging as well.
    
    Fixes: ad6756b4d773 ("drm/amd/display: Shift dc link aux to aux_payload")
    Cc: Mario Limonciello <mario.limonciello@amd.com>
    Cc: Alex Deucher <alexander.deucher@amd.com>
    Reviewed-by: Ray Wu <ray.wu@amd.com>
    Signed-off-by: Wayne Lin <Wayne.Lin@amd.com>
    Signed-off-by: Ray Wu <ray.wu@amd.com>
    Tested-by: Daniel Wheeler <daniel.wheeler@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    (cherry picked from commit 3637e457eb0000bc37d8bbbec95964aad2fb29fd)
    Cc: stable@vger.kernel.org
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

drm/amd/display: Remove incorrect checking in dmub aux handler [+ + +]

Author: Wayne Lin <Wayne.Lin@amd.com>
Date:   Sun Apr 20 16:56:54 2025 +0800

    drm/amd/display: Remove incorrect checking in dmub aux handler
    
    commit 396dc51b3b7ea524bf8061f478332d0039e96d5d upstream.
    
    [Why & How]
    "Request length != reply length" is expected behavior defined in spec.
    It's not an invalid reply. Besides, replied data handling logic is not
    designed to be written in amdgpu_dm_process_dmub_aux_transfer_sync().
    Remove the incorrectly handling section.
    
    Fixes: ead08b95fa50 ("drm/amd/display: Fix race condition in DPIA AUX transfer")
    Cc: Mario Limonciello <mario.limonciello@amd.com>
    Cc: Alex Deucher <alexander.deucher@amd.com>
    Reviewed-by: Ray Wu <ray.wu@amd.com>
    Signed-off-by: Wayne Lin <Wayne.Lin@amd.com>
    Signed-off-by: Ray Wu <ray.wu@amd.com>
    Tested-by: Daniel Wheeler <daniel.wheeler@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    (cherry picked from commit 81b5c6fa62af62fe89ae9576f41aae37830b94cb)
    Cc: stable@vger.kernel.org
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

drm/amd/display: Shift DMUB AUX reply command if necessary [+ + +]

Author: Wayne Lin <Wayne.Lin@amd.com>
Date:   Fri Apr 18 16:31:59 2025 +0800

    drm/amd/display: Shift DMUB AUX reply command if necessary
    
    commit 5a3846648c0523fd850b7f0aec78c0139453ab8b upstream.
    
    [Why]
    Defined value of dmub AUX reply command field get updated but didn't
    adjust dm receiving side accordingly.
    
    [How]
    Check the received reply command value to see if it's updated version
    or not. Adjust it if necessary.
    
    Fixes: ead08b95fa50 ("drm/amd/display: Fix race condition in DPIA AUX transfer")
    Cc: Mario Limonciello <mario.limonciello@amd.com>
    Cc: Alex Deucher <alexander.deucher@amd.com>
    Reviewed-by: Ray Wu <ray.wu@amd.com>
    Signed-off-by: Wayne Lin <Wayne.Lin@amd.com>
    Signed-off-by: Ray Wu <ray.wu@amd.com>
    Tested-by: Daniel Wheeler <daniel.wheeler@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    (cherry picked from commit d5c9ade755a9afa210840708a12a8f44c0d532f4)
    Cc: stable@vger.kernel.org
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

drm/amdgpu/hdp5.2: use memcfg register to post the write for HDP flush [+ + +]

Author: Alex Deucher <alexander.deucher@amd.com>
Date:   Wed Apr 30 12:47:37 2025 -0400

    drm/amdgpu/hdp5.2: use memcfg register to post the write for HDP flush
    
    commit dbc988c689333faeeed44d5561f372ff20395304 upstream.
    
    Reading back the remapped HDP flush register seems to cause
    problems on some platforms. All we need is a read, so read back
    the memcfg register.
    
    Fixes: f756dbac1ce1 ("drm/amdgpu/hdp5.2: do a posting read when flushing HDP")
    Reported-by: Alexey Klimov <alexey.klimov@linaro.org>
    Link: https://lists.freedesktop.org/archives/amd-gfx/2025-April/123150.html
    Closes: https://gitlab.freedesktop.org/drm/amd/-/issues/4119
    Closes: https://gitlab.freedesktop.org/drm/amd/-/issues/3908
    Reviewed-by: Felix Kuehling <felix.kuehling@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    (cherry picked from commit 4a89b7698e771914b4d5b571600c76e2fdcbe2a9)
    Cc: stable@vger.kernel.org
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

drm/panel: simple: Update timings for AUO G101EVN010 [+ + +]

Author: Kevin Baker <kevinb@ventureresearch.com>
Date:   Mon May 5 12:02:56 2025 -0500

    drm/panel: simple: Update timings for AUO G101EVN010
    
    [ Upstream commit 7c6fa1797a725732981f2d77711c867166737719 ]
    
    Switch to panel timings based on datasheet for the AUO G101EVN01.0
    LVDS panel. Default timings were tested on the panel.
    
    Previous mode-based timings resulted in horizontal display shift.
    
    Signed-off-by: Kevin Baker <kevinb@ventureresearch.com>
    Fixes: 4fb86404a977 ("drm/panel: simple: Add AUO G101EVN010 panel support")
    Reviewed-by: Neil Armstrong <neil.armstrong@linaro.org>
    Link: https://lore.kernel.org/r/20250505170256.1385113-1-kevinb@ventureresearch.com
    Signed-off-by: Neil Armstrong <neil.armstrong@linaro.org>
    Link: https://lore.kernel.org/r/20250505170256.1385113-1-kevinb@ventureresearch.com
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/v3d: Add job to pending list if the reset was skipped [+ + +]

Author: Maíra Canal <mcanal@igalia.com>
Date:   Wed Apr 30 17:51:52 2025 -0300

    drm/v3d: Add job to pending list if the reset was skipped
    
    commit 35e4079bf1a2570abffce6ababa631afcf8ea0e5 upstream.
    
    When a CL/CSD job times out, we check if the GPU has made any progress
    since the last timeout. If so, instead of resetting the hardware, we skip
    the reset and let the timer get rearmed. This gives long-running jobs a
    chance to complete.
    
    However, when `timedout_job()` is called, the job in question is removed
    from the pending list, which means it won't be automatically freed through
    `free_job()`. Consequently, when we skip the reset and keep the job
    running, the job won't be freed when it finally completes.
    
    This situation leads to a memory leak, as exposed in [1] and [2].
    
    Similarly to commit 704d3d60fec4 ("drm/etnaviv: don't block scheduler when
    GPU is still active"), this patch ensures the job is put back on the
    pending list when extending the timeout.
    
    Cc: stable@vger.kernel.org # 6.0
    Reported-by: Daivik Bhatia <dtgs1208@gmail.com>
    Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/12227 [1]
    Closes: https://github.com/raspberrypi/linux/issues/6817 [2]
    Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>
    Acked-by: Tvrtko Ursulin <tvrtko.ursulin@igalia.com>
    Link: https://lore.kernel.org/r/20250430210643.57924-1-mcanal@igalia.com
    Signed-off-by: Maíra Canal <mcanal@igalia.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

gre: Fix again IPv6 link-local address generation. [+ + +]

Author: Guillaume Nault <gnault@redhat.com>
Date:   Sat May 3 00:57:52 2025 +0200

    gre: Fix again IPv6 link-local address generation.
    
    [ Upstream commit 3e6a0243ff002ddbd7ee18a8974ae61d2e6ed00d ]
    
    Use addrconf_addr_gen() to generate IPv6 link-local addresses on GRE
    devices in most cases and fall back to using add_v4_addrs() only in
    case the GRE configuration is incompatible with addrconf_addr_gen().
    
    GRE used to use addrconf_addr_gen() until commit e5dd729460ca ("ip/ip6_gre:
    use the same logic as SIT interfaces when computing v6LL address")
    restricted this use to gretap and ip6gretap devices, and created
    add_v4_addrs() (borrowed from SIT) for non-Ethernet GRE ones.
    
    The original problem came when commit 9af28511be10 ("addrconf: refuse
    isatap eui64 for INADDR_ANY") made __ipv6_isatap_ifid() fail when its
    addr parameter was 0. The commit says that this would create an invalid
    address, however, I couldn't find any RFC saying that the generated
    interface identifier would be wrong. Anyway, since gre over IPv4
    devices pass their local tunnel address to __ipv6_isatap_ifid(), that
    commit broke their IPv6 link-local address generation when the local
    address was unspecified.
    
    Then commit e5dd729460ca ("ip/ip6_gre: use the same logic as SIT
    interfaces when computing v6LL address") tried to fix that case by
    defining add_v4_addrs() and calling it to generate the IPv6 link-local
    address instead of using addrconf_addr_gen() (apart for gretap and
    ip6gretap devices, which would still use the regular
    addrconf_addr_gen(), since they have a MAC address).
    
    That broke several use cases because add_v4_addrs() isn't properly
    integrated into the rest of IPv6 Neighbor Discovery code. Several of
    these shortcomings have been fixed over time, but add_v4_addrs()
    remains broken on several aspects. In particular, it doesn't send any
    Router Sollicitations, so the SLAAC process doesn't start until the
    interface receives a Router Advertisement. Also, add_v4_addrs() mostly
    ignores the address generation mode of the interface
    (/proc/sys/net/ipv6/conf/*/addr_gen_mode), thus breaking the
    IN6_ADDR_GEN_MODE_RANDOM and IN6_ADDR_GEN_MODE_STABLE_PRIVACY cases.
    
    Fix the situation by using add_v4_addrs() only in the specific scenario
    where the normal method would fail. That is, for interfaces that have
    all of the following characteristics:
    
      * run over IPv4,
      * transport IP packets directly, not Ethernet (that is, not gretap
        interfaces),
      * tunnel endpoint is INADDR_ANY (that is, 0),
      * device address generation mode is EUI64.
    
    In all other cases, revert back to the regular addrconf_addr_gen().
    
    Also, remove the special case for ip6gre interfaces in add_v4_addrs(),
    since ip6gre devices now always use addrconf_addr_gen() instead.
    
    Note:
      This patch was originally applied as commit 183185a18ff9 ("gre: Fix
      IPv6 link-local address generation."). However, it was then reverted
      by commit fc486c2d060f ("Revert "gre: Fix IPv6 link-local address
      generation."") because it uncovered another bug that ended up
      breaking net/forwarding/ip6gre_custom_multipath_hash.sh. That other
      bug has now been fixed by commit 4d0ab3a6885e ("ipv6: Start path
      selection from the first nexthop"). Therefore we can now revive this
      GRE patch (no changes since original commit 183185a18ff9 ("gre: Fix
      IPv6 link-local address generation.").
    
    Fixes: e5dd729460ca ("ip/ip6_gre: use the same logic as SIT interfaces when computing v6LL address")
    Signed-off-by: Guillaume Nault <gnault@redhat.com>
    Reviewed-by: Ido Schimmel <idosch@nvidia.com>
    Link: https://patch.msgid.link/a88cc5c4811af36007645d610c95102dccb360a6.1746225214.git.gnault@redhat.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

iio: accel: adxl355: Make timestamp 64-bit aligned using aligned_s64 [+ + +]

Author: Jonathan Cameron <Jonathan.Cameron@huawei.com>
Date:   Sun Apr 13 11:34:27 2025 +0100

    iio: accel: adxl355: Make timestamp 64-bit aligned using aligned_s64
    
    [ Upstream commit 1bb942287e05dc4c304a003ea85e6dd9a5e7db39 ]
    
    The IIO ABI requires 64-bit aligned timestamps. In this case insufficient
    padding would have been added on architectures where an s64 is only 32-bit
    aligned.  Use aligned_s64 to enforce the correct alignment.
    
    Fixes: 327a0eaf19d5 ("iio: accel: adxl355: Add triggered buffer support")
    Reported-by: David Lechner <dlechner@baylibre.com>
    Reviewed-by: Nuno Sá <nuno.sa@analog.com>
    Reviewed-by: David Lechner <dlechner@baylibre.com>
    Link: https://patch.msgid.link/20250413103443.2420727-5-jic23@kernel.org
    Signed-off-by: Jonathan Cameron <Jonathan.Cameron@huawei.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

iio: accel: adxl367: fix setting odr for activity time update [+ + +]

Author: Lothar Rubusch <l.rubusch@gmail.com>
Date:   Sun Mar 9 19:35:15 2025 +0000

    iio: accel: adxl367: fix setting odr for activity time update
    
    [ Upstream commit 38f67d0264929762e54ae5948703a21f841fe706 ]
    
    Fix setting the odr value to update activity time based on frequency
    derrived by recent odr, and not by obsolete odr value.
    
    The [small] bug: When _adxl367_set_odr() is called with a new odr value,
    it first writes the new odr value to the hardware register
    ADXL367_REG_FILTER_CTL.
    Second, it calls _adxl367_set_act_time_ms(), which calls
    adxl367_time_ms_to_samples(). Here st->odr still holds the old odr value.
    This st->odr member is used to derrive a frequency value, which is
    applied to update ADXL367_REG_TIME_ACT. Hence, the idea is to update
    activity time, based on possibilities and power consumption by the
    current ODR rate.
    Finally, when the function calls return, again in _adxl367_set_odr() the
    new ODR is assigned to st->odr.
    
    The fix: When setting a new ODR value is set to ADXL367_REG_FILTER_CTL,
    also ADXL367_REG_TIME_ACT should probably be updated with a frequency
    based on the recent ODR value and not the old one. Changing the location
    of the assignment to st->odr fixes this.
    
    Fixes: cbab791c5e2a5 ("iio: accel: add ADXL367 driver")
    Signed-off-by: Lothar Rubusch <l.rubusch@gmail.com>
    Reviewed-by: Marcelo Schmitt <marcelo.schmitt1@gmail.com>
    Link: https://patch.msgid.link/20250309193515.2974-1-l.rubusch@gmail.com
    Signed-off-by: Jonathan Cameron <Jonathan.Cameron@huawei.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

iio: adc: ad7606: fix serial register access [+ + +]

Author: Angelo Dureghello <adureghello@baylibre.com>
Date:   Fri Apr 18 20:37:53 2025 +0200

    iio: adc: ad7606: fix serial register access
    
    commit f083f8a21cc785ebe3a33f756a3fa3660611f8db upstream.
    
    Fix register read/write routine as per datasheet.
    
    When reading multiple consecutive registers, only the first one is read
    properly. This is due to missing chip select deassert and assert again
    between first and second 16bit transfer, as shown in the datasheet
    AD7606C-16, rev 0, figure 110.
    
    Fixes: f2a22e1e172f ("iio: adc: ad7606: Add support for software mode for ad7616")
    Reviewed-by: David Lechner <dlechner@baylibre.com>
    Signed-off-by: Angelo Dureghello <adureghello@baylibre.com>
    Link: https://patch.msgid.link/20250418-wip-bl-ad7606-fix-reg-access-v3-1-d5eeb440c738@baylibre.com
    Cc: <Stable@vger.kernel.org>
    Signed-off-by: Jonathan Cameron <Jonathan.Cameron@huawei.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

iio: adc: dln2: Use aligned_s64 for timestamp [+ + +]

Author: Jonathan Cameron <Jonathan.Cameron@huawei.com>
Date:   Sun Apr 13 11:34:26 2025 +0100

    iio: adc: dln2: Use aligned_s64 for timestamp
    
    [ Upstream commit 5097eaae98e53f9ab9d35801c70da819b92ca907 ]
    
    Here the lack of marking allows the overall structure to not be
    sufficiently aligned resulting in misplacement of the timestamp
    in iio_push_to_buffers_with_timestamp(). Use aligned_s64 to
    force the alignment on all architectures.
    
    Fixes: 7c0299e879dd ("iio: adc: Add support for DLN2 ADC")
    Reported-by: David Lechner <dlechner@baylibre.com>
    Reviewed-by: Andy Shevchenko <andy@kernel.org>
    Reviewed-by: Nuno Sá <nuno.sa@analog.com>
    Reviewed-by: David Lechner <dlechner@baylibre.com>
    Link: https://patch.msgid.link/20250413103443.2420727-4-jic23@kernel.org
    Signed-off-by: Jonathan Cameron <Jonathan.Cameron@huawei.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

iio: adis16201: Correct inclinometer channel resolution [+ + +]

Author: Gabriel Shahrouzi <gshahrouzi@gmail.com>
Date:   Mon Apr 21 09:15:39 2025 -0400

    iio: adis16201: Correct inclinometer channel resolution
    
    commit 609bc31eca06c7408e6860d8b46311ebe45c1fef upstream.
    
    The inclinometer channels were previously defined with 14 realbits.
    However, the ADIS16201 datasheet states the resolution for these output
    channels is 12 bits (Page 14, text description; Page 15, table 7).
    
    Correct the realbits value to 12 to accurately reflect the hardware.
    
    Fixes: f7fe1d1dd5a5 ("staging: iio: new adis16201 driver")
    Cc: stable@vger.kernel.org
    Signed-off-by: Gabriel Shahrouzi <gshahrouzi@gmail.com>
    Reviewed-by: Marcelo Schmitt <marcelo.schmitt1@gmail.com>
    Link: https://patch.msgid.link/20250421131539.912966-1-gshahrouzi@gmail.com
    Signed-off-by: Jonathan Cameron <Jonathan.Cameron@huawei.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

iio: imu: st_lsm6dsx: fix possible lockup in st_lsm6dsx_read_fifo [+ + +]

Author: Silvano Seva <s.seva@4sigma.it>
Date:   Tue Mar 11 09:49:47 2025 +0100

    iio: imu: st_lsm6dsx: fix possible lockup in st_lsm6dsx_read_fifo
    
    commit 159ca7f18129834b6f4c7eae67de48e96c752fc9 upstream.
    
    Prevent st_lsm6dsx_read_fifo from falling in an infinite loop in case
    pattern_len is equal to zero and the device FIFO is not empty.
    
    Fixes: 290a6ce11d93 ("iio: imu: add support to lsm6dsx driver")
    Signed-off-by: Silvano Seva <s.seva@4sigma.it>
    Acked-by: Lorenzo Bianconi <lorenzo@kernel.org>
    Link: https://patch.msgid.link/20250311085030.3593-2-s.seva@4sigma.it
    Cc: <Stable@vger.kernel.org>
    Signed-off-by: Jonathan Cameron <Jonathan.Cameron@huawei.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

iio: imu: st_lsm6dsx: fix possible lockup in st_lsm6dsx_read_tagged_fifo [+ + +]

Author: Silvano Seva <s.seva@4sigma.it>
Date:   Tue Mar 11 09:49:49 2025 +0100

    iio: imu: st_lsm6dsx: fix possible lockup in st_lsm6dsx_read_tagged_fifo
    
    commit 8114ef86e2058e2554111b793596f17bee23fa15 upstream.
    
    Prevent st_lsm6dsx_read_tagged_fifo from falling in an infinite loop in
    case pattern_len is equal to zero and the device FIFO is not empty.
    
    Fixes: 801a6e0af0c6 ("iio: imu: st_lsm6dsx: add support to LSM6DSO")
    Signed-off-by: Silvano Seva <s.seva@4sigma.it>
    Acked-by: Lorenzo Bianconi <lorenzo@kernel.org>
    Link: https://patch.msgid.link/20250311085030.3593-4-s.seva@4sigma.it
    Cc: <Stable@vger.kernel.org>
    Signed-off-by: Jonathan Cameron <Jonathan.Cameron@huawei.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

iio: temp: maxim-thermocouple: Fix potential lack of DMA safe buffer. [+ + +]

Author: Jonathan Cameron <Jonathan.Cameron@huawei.com>
Date:   Sun Apr 13 11:34:36 2025 +0100

    iio: temp: maxim-thermocouple: Fix potential lack of DMA safe buffer.
    
    [ Upstream commit f79aeb6c631b57395f37acbfbe59727e355a714c ]
    
    The trick of using __aligned(IIO_DMA_MINALIGN) ensures that there is
    no overlap between buffers used for DMA and those used for driver
    state storage that are before the marking. It doesn't ensure
    anything above state variables found after the marking. Hence
    move this particular bit of state earlier in the structure.
    
    Fixes: 10897f34309b ("iio: temp: maxim_thermocouple: Fix alignment for DMA safety")
    Reviewed-by: David Lechner <dlechner@baylibre.com>
    Link: https://patch.msgid.link/20250413103443.2420727-14-jic23@kernel.org
    Signed-off-by: Jonathan Cameron <Jonathan.Cameron@huawei.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

Input: mtk-pmic-keys - fix possible null pointer dereference [+ + +]

Author: Gary Bisson <bisson.gary@gmail.com>
Date:   Tue Apr 29 09:16:29 2025 -0700

    Input: mtk-pmic-keys - fix possible null pointer dereference
    
    commit 11cdb506d0fbf5ac05bf55f5afcb3a215c316490 upstream.
    
    In mtk_pmic_keys_probe, the regs parameter is only set if the button is
    parsed in the device tree. However, on hardware where the button is left
    floating, that node will most likely be removed not to enable that
    input. In that case the code will try to dereference a null pointer.
    
    Let's use the regs struct instead as it is defined for all supported
    platforms. Note that it is ok setting the key reg even if that latter is
    disabled as the interrupt won't be enabled anyway.
    
    Fixes: b581acb49aec ("Input: mtk-pmic-keys - transfer per-key bit in mtk_pmic_keys_regs")
    Signed-off-by: Gary Bisson <bisson.gary@gmail.com>
    Cc: stable@vger.kernel.org
    Signed-off-by: Dmitry Torokhov <dmitry.torokhov@gmail.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

Input: synaptics - enable InterTouch on Dell Precision M3800 [+ + +]

Author: Aditya Garg <gargaditya08@live.com>
Date:   Wed May 7 12:12:15 2025 -0700

    Input: synaptics - enable InterTouch on Dell Precision M3800
    
    commit a609cb4cc07aa9ab8f50466622814356c06f2c17 upstream.
    
    Enable InterTouch mode on Dell Precision M3800 by adding "DLL060d" to
    the list of SMBus-enabled variants.
    
    Reported-by: Markus Rathgeb <maggu2810@gmail.com>
    Signed-off-by: Aditya Garg <gargaditya08@live.com>
    Link: https://lore.kernel.org/r/PN3PR01MB959789DD6D574E16141E5DC4B888A@PN3PR01MB9597.INDPRD01.PROD.OUTLOOK.COM
    Cc: stable@vger.kernel.org
    Signed-off-by: Dmitry Torokhov <dmitry.torokhov@gmail.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

Input: synaptics - enable InterTouch on Dynabook Portege X30-D [+ + +]

Author: Manuel Fombuena <fombuena@outlook.com>
Date:   Wed May 7 12:05:26 2025 -0700

    Input: synaptics - enable InterTouch on Dynabook Portege X30-D
    
    commit 6d7ea0881000966607772451b789b5fb5766f11d upstream.
    
    [    5.989588] psmouse serio1: synaptics: Your touchpad (PNP: TOS0213 PNP0f03) says it can support a different bus. If i2c-hid and hid-rmi are not used, you might want to try setting psmouse.synaptics_intertouch to 1 and report this to linux-input@vger.kernel.org.
    [    6.039923] psmouse serio1: synaptics: Touchpad model: 1, fw: 9.32, id: 0x1e2a1, caps: 0xf00223/0x840300/0x12e800/0x52d884, board id: 3322, fw id: 2658004
    
    The board is labelled TM3322.
    
    Present on the Toshiba / Dynabook Portege X30-D and possibly others.
    
    Confirmed working well with psmouse.synaptics_intertouch=1 and local build.
    
    Signed-off-by: Manuel Fombuena <fombuena@outlook.com>
    Signed-off-by: Aditya Garg <gargaditya08@live.com>
    Link: https://lore.kernel.org/r/PN3PR01MB9597711E7933A08389FEC31DB888A@PN3PR01MB9597.INDPRD01.PROD.OUTLOOK.COM
    Cc: stable@vger.kernel.org
    Signed-off-by: Dmitry Torokhov <dmitry.torokhov@gmail.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

Input: synaptics - enable InterTouch on Dynabook Portege X30L-G [+ + +]

Author: Aditya Garg <gargaditya08@live.com>
Date:   Wed May 7 12:06:32 2025 -0700

    Input: synaptics - enable InterTouch on Dynabook Portege X30L-G
    
    commit 47d768b32e644b56901bb4bbbdb1feb01ea86c85 upstream.
    
    Enable InterTouch mode on Dynabook Portege X30L-G by adding "TOS01f6" to
    the list of SMBus-enabled variants.
    
    Reported-by: Xuntao Chi <chotaotao1qaz2wsx@gmail.com>
    Tested-by: Xuntao Chi <chotaotao1qaz2wsx@gmail.com>
    Signed-off-by: Aditya Garg <gargaditya08@live.com>
    Link: https://lore.kernel.org/r/PN3PR01MB959786E4AC797160CDA93012B888A@PN3PR01MB9597.INDPRD01.PROD.OUTLOOK.COM
    Cc: stable@vger.kernel.org
    Signed-off-by: Dmitry Torokhov <dmitry.torokhov@gmail.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

Input: synaptics - enable InterTouch on TUXEDO InfinityBook Pro 14 v5 [+ + +]

Author: Aditya Garg <gargaditya08@live.com>
Date:   Wed May 7 12:09:00 2025 -0700

    Input: synaptics - enable InterTouch on TUXEDO InfinityBook Pro 14 v5
    
    commit 2abc698ac77314e0de5b33a6d96a39c5159d88e4 upstream.
    
    Enable InterTouch mode on TUXEDO InfinityBook Pro 14 v5 by adding
    "SYN1221" to the list of SMBus-enabled variants.
    
    Add support for InterTouch on SYN1221 by adding it to the list of
    SMBus-enabled variants.
    
    Reported-by: Matthias Eilert <kernel.hias@eilert.tech>
    Tested-by: Matthias Eilert <kernel.hias@eilert.tech>
    Signed-off-by: Aditya Garg <gargaditya08@live.com>
    Link: https://lore.kernel.org/r/PN3PR01MB9597C033C4BC20EE2A0C4543B888A@PN3PR01MB9597.INDPRD01.PROD.OUTLOOK.COM
    Cc: stable@vger.kernel.org
    Signed-off-by: Dmitry Torokhov <dmitry.torokhov@gmail.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

Input: synaptics - enable SMBus for HP Elitebook 850 G1 [+ + +]

Author: Dmitry Torokhov <dmitry.torokhov@gmail.com>
Date:   Wed May 7 14:52:55 2025 -0700

    Input: synaptics - enable SMBus for HP Elitebook 850 G1
    
    commit f04f03d3e99bc8f89b6af5debf07ff67d961bc23 upstream.
    
    The kernel reports that the touchpad for this device can support
    SMBus mode.
    
    Reported-by: jt <enopatch@gmail.com>
    Link: https://lore.kernel.org/r/iys5dbv3ldddsgobfkxldazxyp54kay4bozzmagga6emy45jop@2ebvuxgaui4u
    Cc: stable@vger.kernel.org
    Signed-off-by: Dmitry Torokhov <dmitry.torokhov@gmail.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

io_uring: always arm linked timeouts prior to issue [+ + +]

Author: Jens Axboe <axboe@kernel.dk>
Date:   Mon May 5 08:34:39 2025 -0600

    io_uring: always arm linked timeouts prior to issue
    
    Commit b53e523261bf058ea4a518b482222e7a277b186b upstream.
    
    There are a few spots where linked timeouts are armed, and not all of
    them adhere to the pre-arm, attempt issue, post-arm pattern. This can
    be problematic if the linked request returns that it will trigger a
    callback later, and does so before the linked timeout is fully armed.
    
    Consolidate all the linked timeout handling into __io_issue_sqe(),
    rather than have it spread throughout the various issue entry points.
    
    Cc: stable@vger.kernel.org
    Link: https://github.com/axboe/liburing/issues/1390
    Reported-by: Chase Hiltz <chase@path.net>
    Signed-off-by: Jens Axboe <axboe@kernel.dk>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

io_uring: ensure deferred completions are posted for multishot [+ + +]

Author: Jens Axboe <axboe@kernel.dk>
Date:   Wed May 7 08:07:09 2025 -0600

    io_uring: ensure deferred completions are posted for multishot
    
    Commit 687b2bae0efff9b25e071737d6af5004e6e35af5 upstream.
    
    Multishot normally uses io_req_post_cqe() to post completions, but when
    stopping it, it may finish up with a deferred completion. This is fine,
    except if another multishot event triggers before the deferred completions
    get flushed. If this occurs, then CQEs may get reordered in the CQ ring,
    and cause confusion on the application side.
    
    When multishot posting via io_req_post_cqe(), flush any pending deferred
    completions first, if any.
    
    Cc: stable@vger.kernel.org # 6.1+
    Reported-by: Norman Maurer <norman_maurer@apple.com>
    Reported-by: Christian Mazakas <christian.mazakas@gmail.com>
    Signed-off-by: Jens Axboe <axboe@kernel.dk>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

ipv4: Drop tos parameter from flowi4_update_output() [+ + +]

Author: Guillaume Nault <gnault@redhat.com>
Date:   Thu Jun 1 18:37:46 2023 +0200

    ipv4: Drop tos parameter from flowi4_update_output()
    
    [ Upstream commit 3f06760c00f56c5fe6c7f3361c2cf64becee1174 ]
    
    Callers of flowi4_update_output() never try to update ->flowi4_tos:
    
      * ip_route_connect() updates ->flowi4_tos with its own current
        value.
    
      * ip_route_newports() has two users: tcp_v4_connect() and
        dccp_v4_connect. Both initialise fl4 with ip_route_connect(), which
        in turn sets ->flowi4_tos with RT_TOS(inet_sk(sk)->tos) and
        ->flowi4_scope based on SOCK_LOCALROUTE.
    
        Then ip_route_newports() updates ->flowi4_tos with
        RT_CONN_FLAGS(sk), which is the same as RT_TOS(inet_sk(sk)->tos),
        unless SOCK_LOCALROUTE is set on the socket. In that case, the
        lowest order bit is set to 1, to eventually inform
        ip_route_output_key_hash() to restrict the scope to RT_SCOPE_LINK.
        This is equivalent to properly setting ->flowi4_scope as
        ip_route_connect() did.
    
      * ip_vs_xmit.c initialises ->flowi4_tos with memset(0), then calls
        flowi4_update_output() with tos=0.
    
      * sctp_v4_get_dst() uses the same RT_CONN_FLAGS_TOS() when
        initialising ->flowi4_tos and when calling flowi4_update_output().
    
    In the end, ->flowi4_tos never changes. So let's just drop the tos
    parameter. This will simplify the conversion of ->flowi4_tos from __u8
    to dscp_t.
    
    Signed-off-by: Guillaume Nault <gnault@redhat.com>
    Signed-off-by: David S. Miller <davem@davemloft.net>
    Stable-dep-of: e34090d7214e ("ipvs: fix uninit-value for saddr in do_output_route4")
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ipvs: fix uninit-value for saddr in do_output_route4 [+ + +]

Author: Julian Anastasov <ja@ssi.bg>
Date:   Sat May 3 01:01:18 2025 +0300

    ipvs: fix uninit-value for saddr in do_output_route4
    
    [ Upstream commit e34090d7214e0516eb8722aee295cb2507317c07 ]
    
    syzbot reports for uninit-value for the saddr argument [1].
    commit 4754957f04f5 ("ipvs: do not use random local source address for
    tunnels") already implies that the input value of saddr
    should be ignored but the code is still reading it which can prevent
    to connect the route. Fix it by changing the argument to ret_saddr.
    
    [1]
    BUG: KMSAN: uninit-value in do_output_route4+0x42c/0x4d0 net/netfilter/ipvs/ip_vs_xmit.c:147
     do_output_route4+0x42c/0x4d0 net/netfilter/ipvs/ip_vs_xmit.c:147
     __ip_vs_get_out_rt+0x403/0x21d0 net/netfilter/ipvs/ip_vs_xmit.c:330
     ip_vs_tunnel_xmit+0x205/0x2380 net/netfilter/ipvs/ip_vs_xmit.c:1136
     ip_vs_in_hook+0x1aa5/0x35b0 net/netfilter/ipvs/ip_vs_core.c:2063
     nf_hook_entry_hookfn include/linux/netfilter.h:154 [inline]
     nf_hook_slow+0xf7/0x400 net/netfilter/core.c:626
     nf_hook include/linux/netfilter.h:269 [inline]
     __ip_local_out+0x758/0x7e0 net/ipv4/ip_output.c:118
     ip_local_out net/ipv4/ip_output.c:127 [inline]
     ip_send_skb+0x6a/0x3c0 net/ipv4/ip_output.c:1501
     udp_send_skb+0xfda/0x1b70 net/ipv4/udp.c:1195
     udp_sendmsg+0x2fe3/0x33c0 net/ipv4/udp.c:1483
     inet_sendmsg+0x1fc/0x280 net/ipv4/af_inet.c:851
     sock_sendmsg_nosec net/socket.c:712 [inline]
     __sock_sendmsg+0x267/0x380 net/socket.c:727
     ____sys_sendmsg+0x91b/0xda0 net/socket.c:2566
     ___sys_sendmsg+0x28d/0x3c0 net/socket.c:2620
     __sys_sendmmsg+0x41d/0x880 net/socket.c:2702
     __compat_sys_sendmmsg net/compat.c:360 [inline]
     __do_compat_sys_sendmmsg net/compat.c:367 [inline]
     __se_compat_sys_sendmmsg net/compat.c:364 [inline]
     __ia32_compat_sys_sendmmsg+0xc8/0x140 net/compat.c:364
     ia32_sys_call+0x3ffa/0x41f0 arch/x86/include/generated/asm/syscalls_32.h:346
     do_syscall_32_irqs_on arch/x86/entry/syscall_32.c:83 [inline]
     __do_fast_syscall_32+0xb0/0x110 arch/x86/entry/syscall_32.c:306
     do_fast_syscall_32+0x38/0x80 arch/x86/entry/syscall_32.c:331
     do_SYSENTER_32+0x1f/0x30 arch/x86/entry/syscall_32.c:369
     entry_SYSENTER_compat_after_hwframe+0x84/0x8e
    
    Uninit was created at:
     slab_post_alloc_hook mm/slub.c:4167 [inline]
     slab_alloc_node mm/slub.c:4210 [inline]
     __kmalloc_cache_noprof+0x8fa/0xe00 mm/slub.c:4367
     kmalloc_noprof include/linux/slab.h:905 [inline]
     ip_vs_dest_dst_alloc net/netfilter/ipvs/ip_vs_xmit.c:61 [inline]
     __ip_vs_get_out_rt+0x35d/0x21d0 net/netfilter/ipvs/ip_vs_xmit.c:323
     ip_vs_tunnel_xmit+0x205/0x2380 net/netfilter/ipvs/ip_vs_xmit.c:1136
     ip_vs_in_hook+0x1aa5/0x35b0 net/netfilter/ipvs/ip_vs_core.c:2063
     nf_hook_entry_hookfn include/linux/netfilter.h:154 [inline]
     nf_hook_slow+0xf7/0x400 net/netfilter/core.c:626
     nf_hook include/linux/netfilter.h:269 [inline]
     __ip_local_out+0x758/0x7e0 net/ipv4/ip_output.c:118
     ip_local_out net/ipv4/ip_output.c:127 [inline]
     ip_send_skb+0x6a/0x3c0 net/ipv4/ip_output.c:1501
     udp_send_skb+0xfda/0x1b70 net/ipv4/udp.c:1195
     udp_sendmsg+0x2fe3/0x33c0 net/ipv4/udp.c:1483
     inet_sendmsg+0x1fc/0x280 net/ipv4/af_inet.c:851
     sock_sendmsg_nosec net/socket.c:712 [inline]
     __sock_sendmsg+0x267/0x380 net/socket.c:727
     ____sys_sendmsg+0x91b/0xda0 net/socket.c:2566
     ___sys_sendmsg+0x28d/0x3c0 net/socket.c:2620
     __sys_sendmmsg+0x41d/0x880 net/socket.c:2702
     __compat_sys_sendmmsg net/compat.c:360 [inline]
     __do_compat_sys_sendmmsg net/compat.c:367 [inline]
     __se_compat_sys_sendmmsg net/compat.c:364 [inline]
     __ia32_compat_sys_sendmmsg+0xc8/0x140 net/compat.c:364
     ia32_sys_call+0x3ffa/0x41f0 arch/x86/include/generated/asm/syscalls_32.h:346
     do_syscall_32_irqs_on arch/x86/entry/syscall_32.c:83 [inline]
     __do_fast_syscall_32+0xb0/0x110 arch/x86/entry/syscall_32.c:306
     do_fast_syscall_32+0x38/0x80 arch/x86/entry/syscall_32.c:331
     do_SYSENTER_32+0x1f/0x30 arch/x86/entry/syscall_32.c:369
     entry_SYSENTER_compat_after_hwframe+0x84/0x8e
    
    CPU: 0 UID: 0 PID: 22408 Comm: syz.4.5165 Not tainted 6.15.0-rc3-syzkaller-00019-gbc3372351d0c #0 PREEMPT(undef)
    Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 02/12/2025
    
    Reported-by: syzbot+04b9a82855c8aed20860@syzkaller.appspotmail.com
    Closes: https://lore.kernel.org/netdev/68138dfa.050a0220.14dd7d.0017.GAE@google.com/
    Fixes: 4754957f04f5 ("ipvs: do not use random local source address for tunnels")
    Signed-off-by: Julian Anastasov <ja@ssi.bg>
    Acked-by: Simon Horman <horms@kernel.org>
    Signed-off-by: Pablo Neira Ayuso <pablo@netfilter.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ksmbd: fix memory leak in parse_lease_state() [+ + +]

Author: Wang Zhaolong <wangzhaolong1@huawei.com>
Date:   Wed Apr 30 11:16:23 2025 +0800

    ksmbd: fix memory leak in parse_lease_state()
    
    [ Upstream commit eb4447bcce915b43b691123118893fca4f372a8f ]
    
    The previous patch that added bounds check for create lease context
    introduced a memory leak. When the bounds check fails, the function
    returns NULL without freeing the previously allocated lease_ctx_info
    structure.
    
    This patch fixes the issue by adding kfree(lreq) before returning NULL
    in both boundary check cases.
    
    Fixes: bab703ed8472 ("ksmbd: add bounds check for create lease context")
    Signed-off-by: Wang Zhaolong <wangzhaolong1@huawei.com>
    Acked-by: Namjae Jeon <linkinjeon@kernel.org>
    Signed-off-by: Steve French <stfrench@microsoft.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ksmbd: prevent out-of-bounds stream writes by validating *pos [+ + +]

Author: Norbert Szetei <norbert@doyensec.com>
Date:   Fri May 2 08:21:58 2025 +0900

    ksmbd: prevent out-of-bounds stream writes by validating *pos
    
    commit 0ca6df4f40cf4c32487944aaf48319cb6c25accc upstream.
    
    ksmbd_vfs_stream_write() did not validate whether the write offset
    (*pos) was within the bounds of the existing stream data length (v_len).
    If *pos was greater than or equal to v_len, this could lead to an
    out-of-bounds memory write.
    
    This patch adds a check to ensure *pos is less than v_len before
    proceeding. If the condition fails, -EINVAL is returned.
    
    Cc: stable@vger.kernel.org
    Signed-off-by: Norbert Szetei <norbert@doyensec.com>
    Acked-by: Namjae Jeon <linkinjeon@kernel.org>
    Signed-off-by: Steve French <stfrench@microsoft.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

Linux: Linux 6.1.139 [+ + +]

Author: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
Date:   Sun May 18 08:21:27 2025 +0200

    Linux 6.1.139
    
    Link: https://lore.kernel.org/r/20250512172023.126467649@linuxfoundation.org
    Tested-by: Jon Hunter <jonathanh@nvidia.com>
    Tested-by: Salvatore Bonaccorso <carnil@debian.org>
    Tested-by: Florian Fainelli <florian.fainelli@broadcom.com>
    Tested-by: Mark Brown <broonie@kernel.org>
    Tested-by: Ron Economos <re@w6rz.net>
    Tested-by: Peter Schneider <pschneider1968@googlemail.com>
    Tested-by: Linux Kernel Functional Testing <lkft@linaro.org>
    Tested-by: Shuah Khan <skhan@linuxfoundation.org>
    Tested-by: Hardik Garg <hargar@linux.microsoft.com>
    Link: https://lore.kernel.org/r/20250514125614.705014741@linuxfoundation.org
    Tested-by: Jon Hunter <jonathanh@nvidia.com>
    Tested-by: Florian Fainelli <florian.fainelli@broadcom.com>
    Tested-by: Mark Brown <broonie@kernel.org>
    Tested-by: Linux Kernel Functional Testing <lkft@linaro.org>
    Tested-by: Peter Schneider <pschneider1968@googlemail.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

MIPS: Fix MAX_REG_OFFSET [+ + +]

Author: Thorsten Blum <thorsten.blum@linux.dev>
Date:   Sun Apr 27 13:34:24 2025 +0200

    MIPS: Fix MAX_REG_OFFSET
    
    [ Upstream commit c44572e0cc13c9afff83fd333135a0aa9b27ba26 ]
    
    Fix MAX_REG_OFFSET to point to the last register in 'pt_regs' and not to
    the marker itself, which could allow regs_get_register() to return an
    invalid offset.
    
    Fixes: 40e084a506eb ("MIPS: Add uprobes support.")
    Suggested-by: Maciej W. Rozycki <macro@orcam.me.uk>
    Signed-off-by: Thorsten Blum <thorsten.blum@linux.dev>
    Signed-off-by: Thomas Bogendoerfer <tsbogend@alpha.franken.de>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

module: ensure that kobject_put() is safe for module type kobjects [+ + +]

Author: Dmitry Antipov <dmantipov@yandex.ru>
Date:   Wed May 7 09:50:44 2025 +0300

    module: ensure that kobject_put() is safe for module type kobjects
    
    commit a6aeb739974ec73e5217c75a7c008a688d3d5cf1 upstream.
    
    In 'lookup_or_create_module_kobject()', an internal kobject is created
    using 'module_ktype'. So call to 'kobject_put()' on error handling
    path causes an attempt to use an uninitialized completion pointer in
    'module_kobject_release()'. In this scenario, we just want to release
    kobject without an extra synchronization required for a regular module
    unloading process, so adding an extra check whether 'complete()' is
    actually required makes 'kobject_put()' safe.
    
    Reported-by: syzbot+7fb8a372e1f6add936dd@syzkaller.appspotmail.com
    Closes: https://syzkaller.appspot.com/bug?extid=7fb8a372e1f6add936dd
    Fixes: 942e443127e9 ("module: Fix mod->mkobj.kobj potentially freed too early")
    Cc: stable@vger.kernel.org
    Suggested-by: Petr Pavlu <petr.pavlu@suse.com>
    Signed-off-by: Dmitry Antipov <dmantipov@yandex.ru>
    Link: https://lore.kernel.org/r/20250507065044.86529-1-dmantipov@yandex.ru
    Signed-off-by: Petr Pavlu <petr.pavlu@suse.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

net: dsa: b53: allow leaky reserved multicast [+ + +]

Author: Jonas Gorski <jonas.gorski@gmail.com>
Date:   Tue Apr 29 22:17:00 2025 +0200

    net: dsa: b53: allow leaky reserved multicast
    
    [ Upstream commit 5f93185a757ff38b36f849c659aeef368db15a68 ]
    
    Allow reserved multicast to ignore VLAN membership so STP and other
    management protocols work without a PVID VLAN configured when using a
    vlan aware bridge.
    
    Fixes: 967dd82ffc52 ("net: dsa: b53: Add support for Broadcom RoboSwitch")
    Signed-off-by: Jonas Gorski <jonas.gorski@gmail.com>
    Tested-by: Florian Fainelli <florian.fainelli@broadcom.com>
    Reviewed-by: Florian Fainelli <florian.fainelli@broadcom.com>
    Link: https://patch.msgid.link/20250429201710.330937-2-jonas.gorski@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: dsa: b53: always rejoin default untagged VLAN on bridge leave [+ + +]

Author: Jonas Gorski <jonas.gorski@gmail.com>
Date:   Tue Apr 29 22:17:05 2025 +0200

    net: dsa: b53: always rejoin default untagged VLAN on bridge leave
    
    [ Upstream commit 13b152ae40495966501697693f048f47430c50fd ]
    
    While JOIN_ALL_VLAN allows to join all VLANs, we still need to keep the
    default VLAN enabled so that untagged traffic stays untagged.
    
    So rejoin the default VLAN even for switches with JOIN_ALL_VLAN support.
    
    Fixes: 48aea33a77ab ("net: dsa: b53: Add JOIN_ALL_VLAN support")
    Signed-off-by: Jonas Gorski <jonas.gorski@gmail.com>
    Tested-by: Florian Fainelli <florian.fainelli@broadcom.com>
    Reviewed-by: Florian Fainelli <florian.fainelli@broadcom.com>
    Link: https://patch.msgid.link/20250429201710.330937-7-jonas.gorski@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: dsa: b53: fix clearing PVID of a port [+ + +]

Author: Jonas Gorski <jonas.gorski@gmail.com>
Date:   Tue Apr 29 22:17:02 2025 +0200

    net: dsa: b53: fix clearing PVID of a port
    
    [ Upstream commit f480851981043d9bb6447ca9883ade9247b9a0ad ]
    
    Currently the PVID of ports are only set when adding/updating VLANs with
    PVID set or removing VLANs, but not when clearing the PVID flag of a
    VLAN.
    
    E.g. the following flow
    
    $ ip link add br0 type bridge vlan_filtering 1
    $ ip link set sw1p1 master bridge
    $ bridge vlan add dev sw1p1 vid 10 pvid untagged
    $ bridge vlan add dev sw1p1 vid 10 untagged
    
    Would keep the PVID set as 10, despite the flag being cleared. Fix this
    by checking if we need to unset the PVID on vlan updates.
    
    Fixes: a2482d2ce349 ("net: dsa: b53: Plug in VLAN support")
    Signed-off-by: Jonas Gorski <jonas.gorski@gmail.com>
    Tested-by: Florian Fainelli <florian.fainelli@broadcom.com>
    Reviewed-by: Florian Fainelli <florian.fainelli@broadcom.com>
    Link: https://patch.msgid.link/20250429201710.330937-4-jonas.gorski@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: dsa: b53: fix flushing old pvid VLAN on pvid change [+ + +]

Author: Jonas Gorski <jonas.gorski@gmail.com>
Date:   Tue Apr 29 22:17:03 2025 +0200

    net: dsa: b53: fix flushing old pvid VLAN on pvid change
    
    [ Upstream commit 083c6b28c0cbcd83b6af1a10f2c82937129b3438 ]
    
    Presumably the intention here was to flush the VLAN of the old pvid, not
    the added VLAN again, which we already flushed before.
    
    Fixes: a2482d2ce349 ("net: dsa: b53: Plug in VLAN support")
    Signed-off-by: Jonas Gorski <jonas.gorski@gmail.com>
    Tested-by: Florian Fainelli <florian.fainelli@broadcom.com>
    Reviewed-by: Florian Fainelli <florian.fainelli@broadcom.com>
    Link: https://patch.msgid.link/20250429201710.330937-5-jonas.gorski@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: dsa: b53: fix learning on VLAN unaware bridges [+ + +]

Author: Jonas Gorski <jonas.gorski@gmail.com>
Date:   Tue Apr 29 22:17:09 2025 +0200

    net: dsa: b53: fix learning on VLAN unaware bridges
    
    [ Upstream commit 9f34ad89bcf0e6df6f8b01f1bdab211493fc66d1 ]
    
    When VLAN filtering is off, we configure the switch to forward, but not
    learn on VLAN table misses. This effectively disables learning while not
    filtering.
    
    Fix this by switching to forward and learn. Setting the learning disable
    register will still control whether learning actually happens.
    
    Fixes: dad8d7c6452b ("net: dsa: b53: Properly account for VLAN filtering")
    Signed-off-by: Jonas Gorski <jonas.gorski@gmail.com>
    Tested-by: Florian Fainelli <florian.fainelli@broadcom.com>
    Reviewed-by: Florian Fainelli <florian.fainelli@broadcom.com>
    Link: https://patch.msgid.link/20250429201710.330937-11-jonas.gorski@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: dsa: b53: fix VLAN ID for untagged vlan on bridge leave [+ + +]

Author: Jonas Gorski <jonas.gorski@gmail.com>
Date:   Tue Apr 29 22:17:04 2025 +0200

    net: dsa: b53: fix VLAN ID for untagged vlan on bridge leave
    
    [ Upstream commit a1c1901c5cc881425cc45992ab6c5418174e9e5a ]
    
    The untagged default VLAN is added to the default vlan, which may be
    one, but we modify the VLAN 0 entry on bridge leave.
    
    Fix this to use the correct VLAN entry for the default pvid.
    
    Fixes: fea83353177a ("net: dsa: b53: Fix default VLAN ID")
    Signed-off-by: Jonas Gorski <jonas.gorski@gmail.com>
    Tested-by: Florian Fainelli <florian.fainelli@broadcom.com>
    Reviewed-by: Florian Fainelli <florian.fainelli@broadcom.com>
    Link: https://patch.msgid.link/20250429201710.330937-6-jonas.gorski@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

netfilter: ipset: fix region locking in hash types [+ + +]

Author: Jozsef Kadlecsik <kadlec@netfilter.org>
Date:   Wed May 7 17:01:59 2025 +0200

    netfilter: ipset: fix region locking in hash types
    
    [ Upstream commit 8478a729c0462273188263136880480729e9efca ]
    
    Region locking introduced in v5.6-rc4 contained three macros to handle
    the region locks: ahash_bucket_start(), ahash_bucket_end() which gave
    back the start and end hash bucket values belonging to a given region
    lock and ahash_region() which should give back the region lock belonging
    to a given hash bucket. The latter was incorrect which can lead to a
    race condition between the garbage collector and adding new elements
    when a hash type of set is defined with timeouts.
    
    Fixes: f66ee0410b1c ("netfilter: ipset: Fix "INFO: rcu detected stall in hash_xxx" reports")
    Reported-by: Kota Toda <kota.toda@gmo-cybersecurity.com>
    Signed-off-by: Jozsef Kadlecsik <kadlec@netfilter.org>
    Signed-off-by: Pablo Neira Ayuso <pablo@netfilter.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

nvme: unblock ctrl state transition for firmware update [+ + +]

Author: Daniel Wagner <wagi@kernel.org>
Date:   Fri May 2 10:58:00 2025 +0200

    nvme: unblock ctrl state transition for firmware update
    
    [ Upstream commit 650415fca0a97472fdd79725e35152614d1aad76 ]
    
    The original nvme subsystem design didn't have a CONNECTING state; the
    state machine allowed transitions from RESETTING to LIVE directly.
    
    With the introduction of nvme fabrics the CONNECTING state was
    introduce. Over time the nvme-pci started to use the CONNECTING state as
    well.
    
    Eventually, a bug fix for the nvme-fc started to depend that the only
    valid transition to LIVE was from CONNECTING. Though this change didn't
    update the firmware update handler which was still depending on
    RESETTING to LIVE transition.
    
    The simplest way to address it for the time being is to switch into
    CONNECTING state before going to LIVE state.
    
    Fixes: d2fe192348f9 ("nvme: only allow entering LIVE from CONNECTING state")
    Reported-by: Guenter Roeck <linux@roeck-us.net>
    Signed-off-by: Daniel Wagner <wagi@kernel.org>
    Closes: https://lore.kernel.org/all/0134ea15-8d5f-41f7-9e9a-d7e6d82accaa@roeck-us.net
    Reviewed-by: Keith Busch <kbusch@kernel.org>
    Reviewed-by: Sagi Grimberg <sagi@grimberg.me>
    Reviewed-by: Guenter Roeck <linux@roeck-us.net>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ocfs2: implement handshaking with ocfs2 recovery thread [+ + +]

Author: Jan Kara <jack@suse.cz>
Date:   Thu Apr 24 15:45:12 2025 +0200

    ocfs2: implement handshaking with ocfs2 recovery thread
    
    commit 8f947e0fd595951460f5a6e1ac29baa82fa02eab upstream.
    
    We will need ocfs2 recovery thread to acknowledge transitions of
    recovery_state when disabling particular types of recovery.  This is
    similar to what currently happens when disabling recovery completely, just
    more general.  Implement the handshake and use it for exit from recovery.
    
    Link: https://lkml.kernel.org/r/20250424134515.18933-5-jack@suse.cz
    Fixes: 5f530de63cfc ("ocfs2: Use s_umount for quota recovery protection")
    Signed-off-by: Jan Kara <jack@suse.cz>
    Reviewed-by: Heming Zhao <heming.zhao@suse.com>
    Tested-by: Heming Zhao <heming.zhao@suse.com>
    Acked-by: Joseph Qi <joseph.qi@linux.alibaba.com>
    Cc: Changwei Ge <gechangwei@live.cn>
    Cc: Joel Becker <jlbec@evilplan.org>
    Cc: Jun Piao <piaojun@huawei.com>
    Cc: Junxiao Bi <junxiao.bi@oracle.com>
    Cc: Mark Fasheh <mark@fasheh.com>
    Cc: Murad Masimov <m.masimov@mt-integration.ru>
    Cc: Shichangkuo <shi.changkuo@h3c.com>
    Cc: <stable@vger.kernel.org>
    Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

ocfs2: stop quota recovery before disabling quotas [+ + +]

Author: Jan Kara <jack@suse.cz>
Date:   Thu Apr 24 15:45:13 2025 +0200

    ocfs2: stop quota recovery before disabling quotas
    
    commit fcaf3b2683b05a9684acdebda706a12025a6927a upstream.
    
    Currently quota recovery is synchronized with unmount using sb->s_umount
    semaphore.  That is however prone to deadlocks because
    flush_workqueue(osb->ocfs2_wq) called from umount code can wait for quota
    recovery to complete while ocfs2_finish_quota_recovery() waits for
    sb->s_umount semaphore.
    
    Grabbing of sb->s_umount semaphore in ocfs2_finish_quota_recovery() is
    only needed to protect that function from disabling of quotas from
    ocfs2_dismount_volume().  Handle this problem by disabling quota recovery
    early during unmount in ocfs2_dismount_volume() instead so that we can
    drop acquisition of sb->s_umount from ocfs2_finish_quota_recovery().
    
    Link: https://lkml.kernel.org/r/20250424134515.18933-6-jack@suse.cz
    Fixes: 5f530de63cfc ("ocfs2: Use s_umount for quota recovery protection")
    Signed-off-by: Jan Kara <jack@suse.cz>
    Reported-by: Shichangkuo <shi.changkuo@h3c.com>
    Reported-by: Murad Masimov <m.masimov@mt-integration.ru>
    Reviewed-by: Heming Zhao <heming.zhao@suse.com>
    Tested-by: Heming Zhao <heming.zhao@suse.com>
    Acked-by: Joseph Qi <joseph.qi@linux.alibaba.com>
    Cc: Changwei Ge <gechangwei@live.cn>
    Cc: Joel Becker <jlbec@evilplan.org>
    Cc: Jun Piao <piaojun@huawei.com>
    Cc: Junxiao Bi <junxiao.bi@oracle.com>
    Cc: Mark Fasheh <mark@fasheh.com>
    Cc: <stable@vger.kernel.org>
    Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

ocfs2: switch osb->disable_recovery to enum [+ + +]

Author: Jan Kara <jack@suse.cz>
Date:   Thu Apr 24 15:45:11 2025 +0200

    ocfs2: switch osb->disable_recovery to enum
    
    commit c0fb83088f0cc4ee4706e0495ee8b06f49daa716 upstream.
    
    Patch series "ocfs2: Fix deadlocks in quota recovery", v3.
    
    This implements another approach to fixing quota recovery deadlocks.  We
    avoid grabbing sb->s_umount semaphore from ocfs2_finish_quota_recovery()
    and instead stop quota recovery early in ocfs2_dismount_volume().
    
    
    This patch (of 3):
    
    We will need more recovery states than just pure enable / disable to fix
    deadlocks with quota recovery.  Switch osb->disable_recovery to enum.
    
    Link: https://lkml.kernel.org/r/20250424134301.1392-1-jack@suse.cz
    Link: https://lkml.kernel.org/r/20250424134515.18933-4-jack@suse.cz
    Fixes: 5f530de63cfc ("ocfs2: Use s_umount for quota recovery protection")
    Signed-off-by: Jan Kara <jack@suse.cz>
    Reviewed-by: Heming Zhao <heming.zhao@suse.com>
    Tested-by: Heming Zhao <heming.zhao@suse.com>
    Acked-by: Joseph Qi <joseph.qi@linux.alibaba.com>
    Cc: Mark Fasheh <mark@fasheh.com>
    Cc: Joel Becker <jlbec@evilplan.org>
    Cc: Junxiao Bi <junxiao.bi@oracle.com>
    Cc: Changwei Ge <gechangwei@live.cn>
    Cc: Jun Piao <piaojun@huawei.com>
    Cc: Murad Masimov <m.masimov@mt-integration.ru>
    Cc: Shichangkuo <shi.changkuo@h3c.com>
    Cc: <stable@vger.kernel.org>
    Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

openvswitch: Fix unsafe attribute parsing in output_userspace() [+ + +]

Author: Eelco Chaudron <echaudro@redhat.com>
Date:   Tue May 6 16:28:54 2025 +0200

    openvswitch: Fix unsafe attribute parsing in output_userspace()
    
    commit 6beb6835c1fbb3f676aebb51a5fee6b77fed9308 upstream.
    
    This patch replaces the manual Netlink attribute iteration in
    output_userspace() with nla_for_each_nested(), which ensures that only
    well-formed attributes are processed.
    
    Fixes: ccb1352e76cf ("net: Add Open vSwitch kernel components.")
    Signed-off-by: Eelco Chaudron <echaudro@redhat.com>
    Acked-by: Ilya Maximets <i.maximets@ovn.org>
    Acked-by: Aaron Conole <aconole@redhat.com>
    Link: https://patch.msgid.link/0bd65949df61591d9171c0dc13e42cea8941da10.1746541734.git.echaudro@redhat.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

rcu/kvfree: Add kvfree_rcu_mightsleep() and kfree_rcu_mightsleep() [+ + +]

Author: Uladzislau Rezki (Sony) <urezki@gmail.com>
Date:   Wed Feb 1 16:08:07 2023 +0100

    rcu/kvfree: Add kvfree_rcu_mightsleep() and kfree_rcu_mightsleep()
    
    [ Upstream commit 608723c41cd951fb32ade2f8371e61c270816175 ]
    
    The kvfree_rcu() and kfree_rcu() APIs are hazardous in that if you forget
    the second argument, it works, but might sleep.  This sleeping can be a
    correctness bug from atomic contexts, and even in non-atomic contexts
    it might introduce unacceptable latencies.  This commit therefore adds
    kvfree_rcu_mightsleep() and kfree_rcu_mightsleep(), which will replace
    the single-argument kvfree_rcu() and kfree_rcu(), respectively.
    
    This commit enables a series of commits that switch from single-argument
    kvfree_rcu() and kfree_rcu() to their _mightsleep() counterparts.  Once
    all of these commits land, the single-argument versions will be removed.
    
    Signed-off-by: Uladzislau Rezki (Sony) <urezki@gmail.com>
    Signed-off-by: Paul E. McKenney <paulmck@kernel.org>
    Stable-dep-of: 511e64e13d8c ("can: gw: fix RCU/BH usage in cgw_create_job()")
    Signed-off-by: Sasha Levin <sashal@kernel.org>

Revert "net: phy: microchip: force IRQ polling mode for lan88xx" [+ + +]

Author: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
Date:   Mon May 12 16:01:41 2025 +0200

    Revert "net: phy: microchip: force IRQ polling mode for lan88xx"
    
    This reverts commit 9b89102fbb8fc5393e2a0f981aafdb3cf43591ee which is
    commit 30a41ed32d3088cd0d682a13d7f30b23baed7e93 upstream.
    
    It is reported to cause NFS boot problems on a Raspberry Pi 3b so revert
    it from this branch for now.
    
    Cc: Fiona Klute <fiona.klute@gmx.de>
    Cc: Andrew Lunn <andrew@lunn.ch>
    Cc: Paolo Abeni <pabeni@redhat.com>
    Cc: Sasha Levin <sashal@kernel.org>
    Link: https://lore.kernel.org/r/aB6uurX99AZWM9I1@finisterre.sirena.org.uk
    Reported-by: Mark Brown <broonie@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

sch_htb: make htb_deactivate() idempotent [+ + +]

Author: Cong Wang <xiyou.wangcong@gmail.com>
Date:   Mon Apr 28 16:29:54 2025 -0700

    sch_htb: make htb_deactivate() idempotent
    
    [ Upstream commit 3769478610135e82b262640252d90f6efb05be71 ]
    
    Alan reported a NULL pointer dereference in htb_next_rb_node()
    after we made htb_qlen_notify() idempotent.
    
    It turns out in the following case it introduced some regression:
    
    htb_dequeue_tree():
      |-> fq_codel_dequeue()
        |-> qdisc_tree_reduce_backlog()
          |-> htb_qlen_notify()
            |-> htb_deactivate()
      |-> htb_next_rb_node()
      |-> htb_deactivate()
    
    For htb_next_rb_node(), after calling the 1st htb_deactivate(), the
    clprio[prio]->ptr could be already set to  NULL, which means
    htb_next_rb_node() is vulnerable here.
    
    For htb_deactivate(), although we checked qlen before calling it, in
    case of qlen==0 after qdisc_tree_reduce_backlog(), we may call it again
    which triggers the warning inside.
    
    To fix the issues here, we need to:
    
    1) Make htb_deactivate() idempotent, that is, simply return if we
       already call it before.
    2) Make htb_next_rb_node() safe against ptr==NULL.
    
    Many thanks to Alan for testing and for the reproducer.
    
    Fixes: 5ba8b837b522 ("sch_htb: make htb_qlen_notify() idempotent")
    Reported-by: Alan J. Wylie <alan@wylie.me.uk>
    Signed-off-by: Cong Wang <xiyou.wangcong@gmail.com>
    Link: https://patch.msgid.link/20250428232955.1740419-2-xiyou.wangcong@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

staging: axis-fifo: Correct handling of tx_fifo_depth for size validation [+ + +]

Author: Gabriel Shahrouzi <gshahrouzi@gmail.com>
Date:   Fri Apr 18 21:29:37 2025 -0400

    staging: axis-fifo: Correct handling of tx_fifo_depth for size validation
    
    commit 2ca34b508774aaa590fc3698a54204706ecca4ba upstream.
    
    Remove erroneous subtraction of 4 from the total FIFO depth read from
    device tree. The stored depth is for checking against total capacity,
    not initial vacancy. This prevented writes near the FIFO's full size.
    
    The check performed just before data transfer, which uses live reads of
    the TDFV register to determine current vacancy, correctly handles the
    initial Depth - 4 hardware state and subsequent FIFO fullness.
    
    Fixes: 4a965c5f89de ("staging: add driver for Xilinx AXI-Stream FIFO v4.1 IP core")
    Cc: stable@vger.kernel.org
    Signed-off-by: Gabriel Shahrouzi <gshahrouzi@gmail.com>
    Link: https://lore.kernel.org/r/20250419012937.674924-1-gshahrouzi@gmail.com
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

staging: axis-fifo: Remove hardware resets for user errors [+ + +]

Author: Gabriel Shahrouzi <gshahrouzi@gmail.com>
Date:   Fri Apr 18 20:43:06 2025 -0400

    staging: axis-fifo: Remove hardware resets for user errors
    
    commit c6e8d85fafa7193613db37da29c0e8d6e2515b13 upstream.
    
    The axis-fifo driver performs a full hardware reset (via
    reset_ip_core()) in several error paths within the read and write
    functions. This reset flushes both TX and RX FIFOs and resets the
    AXI-Stream links.
    
    Allow the user to handle the error without causing hardware disruption
    or data loss in other FIFO paths.
    
    Fixes: 4a965c5f89de ("staging: add driver for Xilinx AXI-Stream FIFO v4.1 IP core")
    Cc: stable@vger.kernel.org
    Signed-off-by: Gabriel Shahrouzi <gshahrouzi@gmail.com>
    Link: https://lore.kernel.org/r/20250419004306.669605-1-gshahrouzi@gmail.com
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

staging: iio: adc: ad7816: Correct conditional logic for store mode [+ + +]

Author: Gabriel Shahrouzi <gshahrouzi@gmail.com>
Date:   Mon Apr 14 11:40:49 2025 -0400

    staging: iio: adc: ad7816: Correct conditional logic for store mode
    
    commit 2e922956277187655ed9bedf7b5c28906e51708f upstream.
    
    The mode setting logic in ad7816_store_mode was reversed due to
    incorrect handling of the strcmp return value. strcmp returns 0 on
    match, so the `if (strcmp(buf, "full"))` block executed when the
    input was not "full".
    
    This resulted in "full" setting the mode to AD7816_PD (power-down) and
    other inputs setting it to AD7816_FULL.
    
    Fix this by checking it against 0 to correctly check for "full" and
    "power-down", mapping them to AD7816_FULL and AD7816_PD respectively.
    
    Fixes: 7924425db04a ("staging: iio: adc: new driver for AD7816 devices")
    Cc: stable@vger.kernel.org
    Signed-off-by: Gabriel Shahrouzi <gshahrouzi@gmail.com>
    Acked-by: Nuno Sá <nuno.sa@analog.com>
    Link: https://lore.kernel.org/stable/20250414152920.467505-1-gshahrouzi%40gmail.com
    Link: https://patch.msgid.link/20250414154050.469482-1-gshahrouzi@gmail.com
    Signed-off-by: Jonathan Cameron <Jonathan.Cameron@huawei.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

types: Complement the aligned types with signed 64-bit one [+ + +]

Author: Andy Shevchenko <andriy.shevchenko@linux.intel.com>
Date:   Tue Sep 3 20:59:04 2024 +0300

    types: Complement the aligned types with signed 64-bit one
    
    [ Upstream commit e4ca0e59c39442546866f3dd514a3a5956577daf ]
    
    Some user may want to use aligned signed 64-bit type.
    Provide it for them.
    
    Signed-off-by: Andy Shevchenko <andriy.shevchenko@linux.intel.com>
    Link: https://patch.msgid.link/20240903180218.3640501-2-andriy.shevchenko@linux.intel.com
    Signed-off-by: Jonathan Cameron <Jonathan.Cameron@huawei.com>
    Stable-dep-of: 1bb942287e05 ("iio: accel: adxl355: Make timestamp 64-bit aligned using aligned_s64")
    Signed-off-by: Sasha Levin <sashal@kernel.org>

usb: cdnsp: Fix issue with resuming from L1 [+ + +]

Author: Pawel Laszczak <pawell@cadence.com>
Date:   Fri Apr 18 04:55:16 2025 +0000

    usb: cdnsp: Fix issue with resuming from L1
    
    commit 241e2ce88e5a494be7a5d44c0697592f1632fbee upstream.
    
    In very rare cases after resuming controller from L1 to L0 it reads
    registers before the clock UTMI have been enabled and as the result
    driver reads incorrect value.
    Most of registers are in APB domain clock but some of them (e.g. PORTSC)
    are in UTMI domain clock.
    After entering to L1 state the UTMI clock can be disabled.
    When controller transition from L1 to L0 the port status change event is
    reported and in interrupt runtime function driver reads PORTSC.
    During this read operation controller synchronize UTMI and APB domain
    but UTMI clock is still disabled and in result it reads 0xFFFFFFFF value.
    To fix this issue driver increases APB timeout value.
    
    The issue is platform specific and if the default value of APB timeout
    is not sufficient then this time should be set Individually for each
    platform.
    
    Fixes: 3d82904559f4 ("usb: cdnsp: cdns3 Add main part of Cadence USBSSP DRD Driver")
    Cc: stable <stable@kernel.org>
    Signed-off-by: Pawel Laszczak <pawell@cadence.com>
    Acked-by: Peter Chen <peter.chen@kernel.org>
    Link: https://lore.kernel.org/r/PH7PR07MB953846C57973E4DB134CAA71DDBF2@PH7PR07MB9538.namprd07.prod.outlook.com
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

usb: cdnsp: fix L1 resume issue for RTL_REVISION_NEW_LPM version [+ + +]

Author: Pawel Laszczak <pawell@cadence.com>
Date:   Fri Apr 25 05:55:40 2025 +0000

    usb: cdnsp: fix L1 resume issue for RTL_REVISION_NEW_LPM version
    
    commit 8614ecdb1570e4fffe87ebdc62b613ed66f1f6a6 upstream.
    
    The controllers with rtl version larger than
    RTL_REVISION_NEW_LPM (0x00002700) has bug which causes that controller
    doesn't resume from L1 state. It happens if after receiving LPM packet
    controller starts transitioning to L1 and in this moment the driver force
    resuming by write operation to PORTSC.PLS.
    It's corner case and happens when write operation to PORTSC occurs during
    device delay before transitioning to L1 after transmitting ACK
    time (TL1TokenRetry).
    
    Forcing transition from L1->L0 by driver for revision larger than
    RTL_REVISION_NEW_LPM is not needed, so driver can simply fix this issue
    through block call of cdnsp_force_l0_go function.
    
    Fixes: 3d82904559f4 ("usb: cdnsp: cdns3 Add main part of Cadence USBSSP DRD Driver")
    Cc: stable <stable@kernel.org>
    Signed-off-by: Pawel Laszczak <pawell@cadence.com>
    Acked-by: Peter Chen <peter.chen@kernel.org>
    Link: https://lore.kernel.org/r/PH7PR07MB9538B55C3A6E71F9ED29E980DD842@PH7PR07MB9538.namprd07.prod.outlook.com
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

usb: gadget: tegra-xudc: ACK ST_RC after clearing CTRL_RUN [+ + +]

Author: Wayne Chang <waynec@nvidia.com>
Date:   Fri Apr 18 16:12:28 2025 +0800

    usb: gadget: tegra-xudc: ACK ST_RC after clearing CTRL_RUN
    
    commit 59820fde001500c167342257650541280c622b73 upstream.
    
    We identified a bug where the ST_RC bit in the status register was not
    being acknowledged after clearing the CTRL_RUN bit in the control
    register. This could lead to unexpected behavior in the USB gadget
    drivers.
    
    This patch resolves the issue by adding the necessary code to explicitly
    acknowledge ST_RC after clearing CTRL_RUN based on the programming
    sequence, ensuring proper state transition.
    
    Fixes: 49db427232fe ("usb: gadget: Add UDC driver for tegra XUSB device mode controller")
    Cc: stable <stable@kernel.org>
    Signed-off-by: Wayne Chang <waynec@nvidia.com>
    Link: https://lore.kernel.org/r/20250418081228.1194779-1-waynec@nvidia.com
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

usb: host: tegra: Prevent host controller crash when OTG port is used [+ + +]

Author: Jim Lin <jilin@nvidia.com>
Date:   Tue Apr 22 19:40:01 2025 +0800

    usb: host: tegra: Prevent host controller crash when OTG port is used
    
    commit 732f35cf8bdfece582f6e4a9c659119036577308 upstream.
    
    When a USB device is connected to the OTG port, the tegra_xhci_id_work()
    routine transitions the PHY to host mode and calls xhci_hub_control()
    with the SetPortFeature command to enable port power.
    
    In certain cases, the XHCI controller may be in a low-power state
    when this operation occurs. If xhci_hub_control() is invoked while
    the controller is suspended, the PORTSC register may return 0xFFFFFFFF,
    indicating a read failure. This causes xhci_hc_died() to be triggered,
    leading to host controller shutdown.
    
    Example backtrace:
    [  105.445736] Workqueue: events tegra_xhci_id_work
    [  105.445747]  dump_backtrace+0x0/0x1e8
    [  105.445759]  xhci_hc_died.part.48+0x40/0x270
    [  105.445769]  tegra_xhci_set_port_power+0xc0/0x240
    [  105.445774]  tegra_xhci_id_work+0x130/0x240
    
    To prevent this, ensure the controller is fully resumed before
    interacting with hardware registers by calling pm_runtime_get_sync()
    prior to the host mode transition and xhci_hub_control().
    
    Fixes: f836e7843036 ("usb: xhci-tegra: Add OTG support")
    Cc: stable <stable@kernel.org>
    Signed-off-by: Jim Lin <jilin@nvidia.com>
    Signed-off-by: Wayne Chang <waynec@nvidia.com>
    Link: https://lore.kernel.org/r/20250422114001.126367-1-waynec@nvidia.com
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

usb: typec: tcpm: delay SNK_TRY_WAIT_DEBOUNCE to SRC_TRYWAIT transition [+ + +]

Author: RD Babiera <rdbabiera@google.com>
Date:   Tue Apr 29 23:47:01 2025 +0000

    usb: typec: tcpm: delay SNK_TRY_WAIT_DEBOUNCE to SRC_TRYWAIT transition
    
    commit e918d3959b5ae0e793b8f815ce62240e10ba03a4 upstream.
    
    This patch fixes Type-C Compliance Test TD 4.7.6 - Try.SNK DRP Connect
    SNKAS.
    
    The compliance tester moves into SNK_UNATTACHED during toggling and
    expects the PUT to apply Rp after tPDDebounce of detection. If the port
    is in SNK_TRY_WAIT_DEBOUNCE, it will move into SRC_TRYWAIT immediately
    and apply Rp. This violates TD 4.7.5.V.3, where the tester confirms that
    the PUT attaches Rp after the transitions to Unattached.SNK for
    tPDDebounce.
    
    Change the tcpm_set_state delay between SNK_TRY_WAIT_DEBOUNCE and
    SRC_TRYWAIT to tPDDebounce.
    
    Fixes: a0a3e04e6b2c ("staging: typec: tcpm: Check for Rp for tPDDebounce")
    Cc: stable <stable@kernel.org>
    Signed-off-by: RD Babiera <rdbabiera@google.com>
    Reviewed-by: Badhri Jagan Sridharan <badhri@google.com>
    Reviewed-by: Heikki Krogerus <heikki.krogerus@linux.intel.com>
    Link: https://lore.kernel.org/r/20250429234703.3748506-2-rdbabiera@google.com
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

usb: typec: ucsi: displayport: Fix NULL pointer access [+ + +]

Author: Andrei Kuchynski <akuchynski@chromium.org>
Date:   Thu Apr 24 08:44:29 2025 +0000

    usb: typec: ucsi: displayport: Fix NULL pointer access
    
    commit 312d79669e71283d05c05cc49a1a31e59e3d9e0e upstream.
    
    This patch ensures that the UCSI driver waits for all pending tasks in the
    ucsi_displayport_work workqueue to finish executing before proceeding with
    the partner removal.
    
    Cc: stable <stable@kernel.org>
    Fixes: af8622f6a585 ("usb: typec: ucsi: Support for DisplayPort alt mode")
    Signed-off-by: Andrei Kuchynski <akuchynski@chromium.org>
    Reviewed-by: Heikki Krogerus <heikki.krogerus@linux.intel.com>
    Reviewed-by: Benson Leung <bleung@chromium.org>
    Link: https://lore.kernel.org/r/20250424084429.3220757-3-akuchynski@chromium.org
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

usb: uhci-platform: Make the clock really optional [+ + +]

Author: Alexey Charkov <alchark@gmail.com>
Date:   Fri Apr 25 18:11:11 2025 +0400

    usb: uhci-platform: Make the clock really optional
    
    commit a5c7973539b010874a37a0e846e62ac6f00553ba upstream.
    
    Device tree bindings state that the clock is optional for UHCI platform
    controllers, and some existing device trees don't provide those - such
    as those for VIA/WonderMedia devices.
    
    The driver however fails to probe now if no clock is provided, because
    devm_clk_get returns an error pointer in such case.
    
    Switch to devm_clk_get_optional instead, so that it could probe again
    on those platforms where no clocks are given.
    
    Cc: stable <stable@kernel.org>
    Fixes: 26c502701c52 ("usb: uhci: Add clk support to uhci-platform")
    Signed-off-by: Alexey Charkov <alchark@gmail.com>
    Link: https://lore.kernel.org/r/20250425-uhci-clock-optional-v1-1-a1d462592f29@gmail.com
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

usb: usbtmc: Fix erroneous generic_read ioctl return [+ + +]

Author: Dave Penkler <dpenkler@gmail.com>
Date:   Fri May 2 09:09:41 2025 +0200

    usb: usbtmc: Fix erroneous generic_read ioctl return
    
    commit 4e77d3ec7c7c0d9535ccf1138827cb9bb5480b9b upstream.
    
    wait_event_interruptible_timeout returns a long
    The return value was being assigned to an int causing an integer overflow
    when the remaining jiffies > INT_MAX which resulted in random error
    returns.
    
    Use a long return value, converting to the int ioctl return only on error.
    
    Fixes: bb99794a4792 ("usb: usbtmc: Add ioctl for vendor specific read")
    Cc: stable@vger.kernel.org
    Signed-off-by: Dave Penkler <dpenkler@gmail.com>
    Link: https://lore.kernel.org/r/20250502070941.31819-4-dpenkler@gmail.com
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

usb: usbtmc: Fix erroneous get_stb ioctl error returns [+ + +]

Author: Dave Penkler <dpenkler@gmail.com>
Date:   Fri May 2 09:09:39 2025 +0200

    usb: usbtmc: Fix erroneous get_stb ioctl error returns
    
    commit cac01bd178d6a2a23727f138d647ce1a0e8a73a1 upstream.
    
    wait_event_interruptible_timeout returns a long
    The return was being assigned to an int causing an integer overflow when
    the remaining jiffies > INT_MAX resulting in random error returns.
    
    Use a long return value and convert to int ioctl return only on error.
    
    When the return value of wait_event_interruptible_timeout was <= INT_MAX
    the number of remaining jiffies was returned which has no meaning for the
    user. Return 0 on success.
    
    Reported-by: Michael Katzmann <vk2bea@gmail.com>
    Fixes: dbf3e7f654c0 ("Implement an ioctl to support the USMTMC-USB488 READ_STATUS_BYTE operation.")
    Cc: stable@vger.kernel.org
    Signed-off-by: Dave Penkler <dpenkler@gmail.com>
    Link: https://lore.kernel.org/r/20250502070941.31819-2-dpenkler@gmail.com
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

usb: usbtmc: Fix erroneous wait_srq ioctl return [+ + +]

Author: Dave Penkler <dpenkler@gmail.com>
Date:   Fri May 2 09:09:40 2025 +0200

    usb: usbtmc: Fix erroneous wait_srq ioctl return
    
    commit a9747c9b8b59ab4207effd20eb91a890acb44e16 upstream.
    
    wait_event_interruptible_timeout returns a long
    The return was being assigned to an int causing an integer overflow when
    the remaining jiffies > INT_MAX resulting in random error returns.
    
    Use a long return value,  converting to the int ioctl return only on
    error.
    
    Fixes: 739240a9f6ac ("usb: usbtmc: Add ioctl USBTMC488_IOCTL_WAIT_SRQ")
    Cc: stable@vger.kernel.org
    Signed-off-by: Dave Penkler <dpenkler@gmail.com>
    Link: https://lore.kernel.org/r/20250502070941.31819-3-dpenkler@gmail.com
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

USB: usbtmc: use interruptible sleep in usbtmc_read [+ + +]

Author: Oliver Neukum <oneukum@suse.com>
Date:   Wed Apr 30 15:48:10 2025 +0200

    USB: usbtmc: use interruptible sleep in usbtmc_read
    
    commit 054c5145540e5ad5b80adf23a5e3e2fc281fb8aa upstream.
    
    usbtmc_read() calls usbtmc_generic_read()
    which uses interruptible sleep, but usbtmc_read()
    itself uses uninterruptble sleep for mutual exclusion
    between threads. That makes no sense.
    Both should use interruptible sleep.
    
    Fixes: 5b775f672cc99 ("USB: add USB test and measurement class driver")
    Cc: stable <stable@kernel.org>
    Signed-off-by: Oliver Neukum <oneukum@suse.com>
    Link: https://lore.kernel.org/r/20250430134810.226015-1-oneukum@suse.com
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

x86/alternative: Optimize returns patching [+ + +]

Author: Borislav Petkov (AMD) <bp@alien8.de>
Date:   Fri May 12 14:05:11 2023 +0200

    x86/alternative: Optimize returns patching
    
    commit d2408e043e7296017420aa5929b3bba4d5e61013 upstream.
    
    Instead of decoding each instruction in the return sites range only to
    realize that that return site is a jump to the default return thunk
    which is needed - X86_FEATURE_RETHUNK is enabled - lift that check
    before the loop and get rid of that loop overhead.
    
    Add comments about what gets patched, while at it.
    
    Signed-off-by: Borislav Petkov (AMD) <bp@alien8.de>
    Acked-by: Peter Zijlstra (Intel) <peterz@infradead.org>
    Link: https://lore.kernel.org/r/20230512120952.7924-1-bp@alien8.de
    Signed-off-by: Pawan Gupta <pawan.kumar.gupta@linux.intel.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

x86/alternatives: Remove faulty optimization [+ + +]

Author: Josh Poimboeuf <jpoimboe@kernel.org>
Date:   Mon Sep 4 22:04:54 2023 -0700

    x86/alternatives: Remove faulty optimization
    
    commit 4ba89dd6ddeca2a733bdaed7c9a5cbe4e19d9124 upstream.
    
    The following commit
    
      095b8303f383 ("x86/alternative: Make custom return thunk unconditional")
    
    made '__x86_return_thunk' a placeholder value.  All code setting
    X86_FEATURE_RETHUNK also changes the value of 'x86_return_thunk'.  So
    the optimization at the beginning of apply_returns() is dead code.
    
    Also, before the above-mentioned commit, the optimization actually had a
    bug It bypassed __static_call_fixup(), causing some raw returns to
    remain unpatched in static call trampolines.  Thus the 'Fixes' tag.
    
    Fixes: d2408e043e72 ("x86/alternative: Optimize returns patching")
    Signed-off-by: Josh Poimboeuf <jpoimboe@kernel.org>
    Signed-off-by: Ingo Molnar <mingo@kernel.org>
    Signed-off-by: Borislav Petkov (AMD) <bp@alien8.de>
    Acked-by: Borislav Petkov (AMD) <bp@alien8.de>
    Link: https://lore.kernel.org/r/16d19d2249d4485d8380fb215ffaae81e6b8119e.1693889988.git.jpoimboe@kernel.org
    Signed-off-by: Pawan Gupta <pawan.kumar.gupta@linux.intel.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

x86/bhi: Do not set BHI_DIS_S in 32-bit mode [+ + +]

Author: Pawan Gupta <pawan.kumar.gupta@linux.intel.com>
Date:   Mon May 5 14:35:12 2025 -0700

    x86/bhi: Do not set BHI_DIS_S in 32-bit mode
    
    commit 073fdbe02c69c43fb7c0d547ec265c7747d4a646 upstream.
    
    With the possibility of intra-mode BHI via cBPF, complete mitigation for
    BHI is to use IBHF (history fence) instruction with BHI_DIS_S set. Since
    this new instruction is only available in 64-bit mode, setting BHI_DIS_S in
    32-bit mode is only a partial mitigation.
    
    Do not set BHI_DIS_S in 32-bit mode so as to avoid reporting misleading
    mitigated status. With this change IBHF won't be used in 32-bit mode, also
    remove the CONFIG_X86_64 check from emit_spectre_bhb_barrier().
    
    Suggested-by: Josh Poimboeuf <jpoimboe@kernel.org>
    Signed-off-by: Pawan Gupta <pawan.kumar.gupta@linux.intel.com>
    Signed-off-by: Dave Hansen <dave.hansen@linux.intel.com>
    Reviewed-by: Josh Poimboeuf <jpoimboe@kernel.org>
    Reviewed-by: Alexandre Chartre <alexandre.chartre@oracle.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

x86/bpf: Add IBHF call at end of classic BPF [+ + +]

Author: Daniel Sneddon <daniel.sneddon@linux.intel.com>
Date:   Mon May 5 14:35:12 2025 -0700

    x86/bpf: Add IBHF call at end of classic BPF
    
    commit 9f725eec8fc0b39bdc07dcc8897283c367c1a163 upstream.
    
    Classic BPF programs can be run by unprivileged users, allowing
    unprivileged code to execute inside the kernel. Attackers can use this to
    craft branch history in kernel mode that can influence the target of
    indirect branches.
    
    BHI_DIS_S provides user-kernel isolation of branch history, but cBPF can be
    used to bypass this protection by crafting branch history in kernel mode.
    To stop intra-mode attacks via cBPF programs, Intel created a new
    instruction Indirect Branch History Fence (IBHF). IBHF prevents the
    predicted targets of subsequent indirect branches from being influenced by
    branch history prior to the IBHF. IBHF is only effective while BHI_DIS_S is
    enabled.
    
    Add the IBHF instruction to cBPF jitted code's exit path. Add the new fence
    when the hardware mitigation is enabled (i.e., X86_FEATURE_CLEAR_BHB_HW is
    set) or after the software sequence (X86_FEATURE_CLEAR_BHB_LOOP) is being
    used in a virtual machine. Note that X86_FEATURE_CLEAR_BHB_HW and
    X86_FEATURE_CLEAR_BHB_LOOP are mutually exclusive, so the JIT compiler will
    only emit the new fence, not the SW sequence, when X86_FEATURE_CLEAR_BHB_HW
    is set.
    
    Hardware that enumerates BHI_NO basically has BHI_DIS_S protections always
    enabled, regardless of the value of BHI_DIS_S. Since BHI_DIS_S doesn't
    protect against intra-mode attacks, enumerate BHI bug on BHI_NO hardware as
    well.
    
    Signed-off-by: Daniel Sneddon <daniel.sneddon@linux.intel.com>
    Signed-off-by: Pawan Gupta <pawan.kumar.gupta@linux.intel.com>
    Signed-off-by: Dave Hansen <dave.hansen@linux.intel.com>
    Acked-by: Daniel Borkmann <daniel@iogearbox.net>
    Reviewed-by: Alexandre Chartre <alexandre.chartre@oracle.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

x86/bpf: Call branch history clearing sequence on exit [+ + +]

Author: Daniel Sneddon <daniel.sneddon@linux.intel.com>
Date:   Mon May 5 14:35:12 2025 -0700

    x86/bpf: Call branch history clearing sequence on exit
    
    commit d4e89d212d401672e9cdfe825d947ee3a9fbe3f5 upstream.
    
    Classic BPF programs have been identified as potential vectors for
    intra-mode Branch Target Injection (BTI) attacks. Classic BPF programs can
    be run by unprivileged users. They allow unprivileged code to execute
    inside the kernel. Attackers can use unprivileged cBPF to craft branch
    history in kernel mode that can influence the target of indirect branches.
    
    Introduce a branch history buffer (BHB) clearing sequence during the JIT
    compilation of classic BPF programs. The clearing sequence is the same as
    is used in previous mitigations to protect syscalls. Since eBPF programs
    already have their own mitigations in place, only insert the call on
    classic programs that aren't run by privileged users.
    
    Signed-off-by: Daniel Sneddon <daniel.sneddon@linux.intel.com>
    Signed-off-by: Pawan Gupta <pawan.kumar.gupta@linux.intel.com>
    Signed-off-by: Dave Hansen <dave.hansen@linux.intel.com>
    Acked-by: Daniel Borkmann <daniel@iogearbox.net>
    Reviewed-by: Alexandre Chartre <alexandre.chartre@oracle.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

x86/ibt: Keep IBT disabled during alternative patching [+ + +]

Author: Pawan Gupta <pawan.kumar.gupta@linux.intel.com>
Date:   Sat May 3 09:46:31 2025 -0700

    x86/ibt: Keep IBT disabled during alternative patching
    
    commit ebebe30794d38c51f71fe4951ba6af4159d9837d upstream.
    
    cfi_rewrite_callers() updates the fineIBT hash matching at the caller side,
    but except for paranoid-mode it relies on apply_retpoline() and friends for
    any ENDBR relocation. This could temporarily cause an indirect branch to
    land on a poisoned ENDBR.
    
    For instance, with para-virtualization enabled, a simple wrmsrl() could
    have an indirect branch pointing to native_write_msr() who's ENDBR has been
    relocated due to fineIBT:
    
    <wrmsrl>:
           push   %rbp
           mov    %rsp,%rbp
           mov    %esi,%eax
           mov    %rsi,%rdx
           shr    $0x20,%rdx
           mov    %edi,%edi
           mov    %rax,%rsi
           call   *0x21e65d0(%rip)        # <pv_ops+0xb8>
           ^^^^^^^^^^^^^^^^^^^^^^^
    
    Such an indirect call during the alternative patching could #CP if the
    caller is not *yet* adjusted for the new target ENDBR. To prevent a false
     #CP, keep CET-IBT disabled until all callers are patched.
    
    Patching during the module load does not need to be guarded by IBT-disable
    because the module code is not executed until the patching is complete.
    
      [ pawan: Since apply_paravirt() happens before __apply_fineibt()
               relocates the ENDBR, pv_ops in the example above is not relevant.
               It is still safer to keep this commit because missing an ENDBR
               means an oops. ]
    
    Signed-off-by: Pawan Gupta <pawan.kumar.gupta@linux.intel.com>
    Signed-off-by: Dave Hansen <dave.hansen@linux.intel.com>
    Reviewed-by: Alexandre Chartre <alexandre.chartre@oracle.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

x86/its: Add "vmexit" option to skip mitigation on some CPUs [+ + +]

Author: Pawan Gupta <pawan.kumar.gupta@linux.intel.com>
Date:   Mon Nov 18 09:53:12 2024 -0800

    x86/its: Add "vmexit" option to skip mitigation on some CPUs
    
    commit 2665281a07e19550944e8354a2024635a7b2714a upstream.
    
    Ice Lake generation CPUs are not affected by guest/host isolation part of
    ITS. If a user is only concerned about KVM guests, they can now choose a
    new cmdline option "vmexit" that will not deploy the ITS mitigation when
    CPU is not affected by guest/host isolation. This saves the performance
    overhead of ITS mitigation on Ice Lake gen CPUs.
    
    When "vmexit" option selected, if the CPU is affected by ITS guest/host
    isolation, the default ITS mitigation is deployed.
    
    Signed-off-by: Pawan Gupta <pawan.kumar.gupta@linux.intel.com>
    Signed-off-by: Dave Hansen <dave.hansen@linux.intel.com>
    Reviewed-by: Josh Poimboeuf <jpoimboe@kernel.org>
    Reviewed-by: Alexandre Chartre <alexandre.chartre@oracle.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

x86/its: Add support for ITS-safe indirect thunk [+ + +]

Author: Pawan Gupta <pawan.kumar.gupta@linux.intel.com>
Date:   Fri Jun 21 21:17:21 2024 -0700

    x86/its: Add support for ITS-safe indirect thunk
    
    commit 8754e67ad4ac692c67ff1f99c0d07156f04ae40c upstream.
    
    Due to ITS, indirect branches in the lower half of a cacheline may be
    vulnerable to branch target injection attack.
    
    Introduce ITS-safe thunks to patch indirect branches in the lower half of
    cacheline with the thunk. Also thunk any eBPF generated indirect branches
    in emit_indirect_jump().
    
    Below category of indirect branches are not mitigated:
    
    - Indirect branches in the .init section are not mitigated because they are
      discarded after boot.
    - Indirect branches that are explicitly marked retpoline-safe.
    
    Note that retpoline also mitigates the indirect branches against ITS. This
    is because the retpoline sequence fills an RSB entry before RET, and it
    does not suffer from RSB-underflow part of the ITS.
    
    Signed-off-by: Pawan Gupta <pawan.kumar.gupta@linux.intel.com>
    Signed-off-by: Dave Hansen <dave.hansen@linux.intel.com>
    Reviewed-by: Josh Poimboeuf <jpoimboe@kernel.org>
    Reviewed-by: Alexandre Chartre <alexandre.chartre@oracle.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

x86/its: Add support for ITS-safe return thunk [+ + +]

Author: Pawan Gupta <pawan.kumar.gupta@linux.intel.com>
Date:   Fri Jun 21 21:17:21 2024 -0700

    x86/its: Add support for ITS-safe return thunk
    
    commit a75bf27fe41abe658c53276a0c486c4bf9adecfc upstream.
    
    RETs in the lower half of cacheline may be affected by ITS bug,
    specifically when the RSB-underflows. Use ITS-safe return thunk for such
    RETs.
    
    RETs that are not patched:
    
    - RET in retpoline sequence does not need to be patched, because the
      sequence itself fills an RSB before RET.
    - RET in Call Depth Tracking (CDT) thunks __x86_indirect_{call|jump}_thunk
      and call_depth_return_thunk are not patched because CDT by design
      prevents RSB-underflow.
    - RETs in .init section are not reachable after init.
    - RETs that are explicitly marked safe with ANNOTATE_UNRET_SAFE.
    
    Signed-off-by: Pawan Gupta <pawan.kumar.gupta@linux.intel.com>
    Signed-off-by: Dave Hansen <dave.hansen@linux.intel.com>
    Reviewed-by: Josh Poimboeuf <jpoimboe@kernel.org>
    Reviewed-by: Alexandre Chartre <alexandre.chartre@oracle.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

x86/its: Align RETs in BHB clear sequence to avoid thunking [+ + +]

Author: Pawan Gupta <pawan.kumar.gupta@linux.intel.com>
Date:   Fri May 2 06:25:19 2025 -0700

    x86/its: Align RETs in BHB clear sequence to avoid thunking
    
    commit f0cd7091cc5a032c8870b4285305d9172569d126 upstream.
    
    The software mitigation for BHI is to execute BHB clear sequence at syscall
    entry, and possibly after a cBPF program. ITS mitigation thunks RETs in the
    lower half of the cacheline. This causes the RETs in the BHB clear sequence
    to be thunked as well, adding unnecessary branches to the BHB clear
    sequence.
    
    Since the sequence is in hot path, align the RET instructions in the
    sequence to avoid thunking.
    
    This is how disassembly clear_bhb_loop() looks like after this change:
    
       0x44 <+4>:     mov    $0x5,%ecx
       0x49 <+9>:     call   0xffffffff81001d9b <clear_bhb_loop+91>
       0x4e <+14>:    jmp    0xffffffff81001de5 <clear_bhb_loop+165>
       0x53 <+19>:    int3
       ...
       0x9b <+91>:    call   0xffffffff81001dce <clear_bhb_loop+142>
       0xa0 <+96>:    ret
       0xa1 <+97>:    int3
       ...
       0xce <+142>:   mov    $0x5,%eax
       0xd3 <+147>:   jmp    0xffffffff81001dd6 <clear_bhb_loop+150>
       0xd5 <+149>:   nop
       0xd6 <+150>:   sub    $0x1,%eax
       0xd9 <+153>:   jne    0xffffffff81001dd3 <clear_bhb_loop+147>
       0xdb <+155>:   sub    $0x1,%ecx
       0xde <+158>:   jne    0xffffffff81001d9b <clear_bhb_loop+91>
       0xe0 <+160>:   ret
       0xe1 <+161>:   int3
       0xe2 <+162>:   int3
       0xe3 <+163>:   int3
       0xe4 <+164>:   int3
       0xe5 <+165>:   lfence
       0xe8 <+168>:   pop    %rbp
       0xe9 <+169>:   ret
    
    Suggested-by: Andrew Cooper <andrew.cooper3@citrix.com>
    Signed-off-by: Pawan Gupta <pawan.kumar.gupta@linux.intel.com>
    Signed-off-by: Dave Hansen <dave.hansen@linux.intel.com>
    Reviewed-by: Alexandre Chartre <alexandre.chartre@oracle.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

x86/its: Enable Indirect Target Selection mitigation [+ + +]

Author: Pawan Gupta <pawan.kumar.gupta@linux.intel.com>
Date:   Fri Jun 21 20:23:23 2024 -0700

    x86/its: Enable Indirect Target Selection mitigation
    
    commit f4818881c47fd91fcb6d62373c57c7844e3de1c0 upstream.
    
    Indirect Target Selection (ITS) is a bug in some pre-ADL Intel CPUs with
    eIBRS. It affects prediction of indirect branch and RETs in the
    lower half of cacheline. Due to ITS such branches may get wrongly predicted
    to a target of (direct or indirect) branch that is located in the upper
    half of the cacheline.
    
    Scope of impact
    ===============
    
    Guest/host isolation
    --------------------
    When eIBRS is used for guest/host isolation, the indirect branches in the
    VMM may still be predicted with targets corresponding to branches in the
    guest.
    
    Intra-mode
    ----------
    cBPF or other native gadgets can be used for intra-mode training and
    disclosure using ITS.
    
    User/kernel isolation
    ---------------------
    When eIBRS is enabled user/kernel isolation is not impacted.
    
    Indirect Branch Prediction Barrier (IBPB)
    -----------------------------------------
    After an IBPB, indirect branches may be predicted with targets
    corresponding to direct branches which were executed prior to IBPB. This is
    mitigated by a microcode update.
    
    Add cmdline parameter indirect_target_selection=off|on|force to control the
    mitigation to relocate the affected branches to an ITS-safe thunk i.e.
    located in the upper half of cacheline. Also add the sysfs reporting.
    
    When retpoline mitigation is deployed, ITS safe-thunks are not needed,
    because retpoline sequence is already ITS-safe. Similarly, when call depth
    tracking (CDT) mitigation is deployed (retbleed=stuff), ITS safe return
    thunk is not used, as CDT prevents RSB-underflow.
    
    To not overcomplicate things, ITS mitigation is not supported with
    spectre-v2 lfence;jmp mitigation. Moreover, it is less practical to deploy
    lfence;jmp mitigation on ITS affected parts anyways.
    
    Signed-off-by: Pawan Gupta <pawan.kumar.gupta@linux.intel.com>
    Signed-off-by: Dave Hansen <dave.hansen@linux.intel.com>
    Reviewed-by: Josh Poimboeuf <jpoimboe@kernel.org>
    Reviewed-by: Alexandre Chartre <alexandre.chartre@oracle.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

x86/its: Enumerate Indirect Target Selection (ITS) bug [+ + +]

Author: Pawan Gupta <pawan.kumar.gupta@linux.intel.com>
Date:   Fri Jun 21 17:40:41 2024 -0700

    x86/its: Enumerate Indirect Target Selection (ITS) bug
    
    commit 159013a7ca18c271ff64192deb62a689b622d860 upstream.
    
    ITS bug in some pre-Alderlake Intel CPUs may allow indirect branches in the
    first half of a cache line get predicted to a target of a branch located in
    the second half of the cache line.
    
    Set X86_BUG_ITS on affected CPUs. Mitigation to follow in later commits.
    
    Signed-off-by: Pawan Gupta <pawan.kumar.gupta@linux.intel.com>
    Signed-off-by: Dave Hansen <dave.hansen@linux.intel.com>
    Reviewed-by: Josh Poimboeuf <jpoimboe@kernel.org>
    Reviewed-by: Alexandre Chartre <alexandre.chartre@oracle.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

x86/its: FineIBT-paranoid vs ITS [+ + +]

Author: Peter Zijlstra <peterz@infradead.org>
Date:   Wed Apr 23 09:57:31 2025 +0200

    x86/its: FineIBT-paranoid vs ITS
    
    commit e52c1dc7455d32c8a55f9949d300e5e87d011fa6 upstream.
    
    FineIBT-paranoid was using the retpoline bytes for the paranoid check,
    disabling retpolines, because all parts that have IBT also have eIBRS
    and thus don't need no stinking retpolines.
    
    Except... ITS needs the retpolines for indirect calls must not be in
    the first half of a cacheline :-/
    
    So what was the paranoid call sequence:
    
      <fineibt_paranoid_start>:
       0:   41 ba 78 56 34 12       mov    $0x12345678, %r10d
       6:   45 3b 53 f7             cmp    -0x9(%r11), %r10d
       a:   4d 8d 5b <f0>           lea    -0x10(%r11), %r11
       e:   75 fd                   jne    d <fineibt_paranoid_start+0xd>
      10:   41 ff d3                call   *%r11
      13:   90                      nop
    
    Now becomes:
    
      <fineibt_paranoid_start>:
       0:   41 ba 78 56 34 12       mov    $0x12345678, %r10d
       6:   45 3b 53 f7             cmp    -0x9(%r11), %r10d
       a:   4d 8d 5b f0             lea    -0x10(%r11), %r11
       e:   2e e8 XX XX XX XX       cs call __x86_indirect_paranoid_thunk_r11
    
      Where the paranoid_thunk looks like:
    
       1d:  <ea>                    (bad)
       __x86_indirect_paranoid_thunk_r11:
       1e:  75 fd                   jne 1d
       __x86_indirect_its_thunk_r11:
       20:  41 ff eb                jmp *%r11
       23:  cc                      int3
    
    [ dhansen: remove initialization to false ]
    
    Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org>
    Signed-off-by: Pawan Gupta <pawan.kumar.gupta@linux.intel.com>
    Signed-off-by: Dave Hansen <dave.hansen@linux.intel.com>
    Reviewed-by: Alexandre Chartre <alexandre.chartre@oracle.com>
    [ Just a portion of the original commit, in order to fix a build issue
      in stable kernels due to backports ]
    Tested-by: Holger Hoffstätte <holger@applied-asynchrony.com>
    Link: https://lore.kernel.org/r/20250514113952.GB16434@noisy.programming.kicks-ass.net
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

x86/its: Fix build errors when CONFIG_MODULES=n [+ + +]

Author: Eric Biggers <ebiggers@google.com>
Date:   Mon May 12 19:58:39 2025 -0700

    x86/its: Fix build errors when CONFIG_MODULES=n
    
    commit 9f35e33144ae5377d6a8de86dd3bd4d995c6ac65 upstream.
    
    Fix several build errors when CONFIG_MODULES=n, including the following:
    
    ../arch/x86/kernel/alternative.c:195:25: error: incomplete definition of type 'struct module'
      195 |         for (int i = 0; i < mod->its_num_pages; i++) {
    
    Fixes: 872df34d7c51 ("x86/its: Use dynamic thunks for indirect branches")
    Cc: stable@vger.kernel.org
    Signed-off-by: Eric Biggers <ebiggers@google.com>
    Acked-by: Dave Hansen <dave.hansen@intel.com>
    Tested-by: Steven Rostedt (Google) <rostedt@goodmis.org>
    Reviewed-by: Alexandre Chartre <alexandre.chartre@oracle.com>
    Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
    [ pawan: backport: Bring ITS dynamic thunk code under CONFIG_MODULES ]
    Signed-off-by: Pawan Gupta <pawan.kumar.gupta@linux.intel.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

x86/its: Use dynamic thunks for indirect branches [+ + +]

Author: Peter Zijlstra <peterz@infradead.org>
Date:   Mon Oct 14 10:05:48 2024 -0700

    x86/its: Use dynamic thunks for indirect branches
    
    commit 872df34d7c51a79523820ea6a14860398c639b87 upstream.
    
    ITS mitigation moves the unsafe indirect branches to a safe thunk. This
    could degrade the prediction accuracy as the source address of indirect
    branches becomes same for different execution paths.
    
    To improve the predictions, and hence the performance, assign a separate
    thunk for each indirect callsite. This is also a defense-in-depth measure
    to avoid indirect branches aliasing with each other.
    
    As an example, 5000 dynamic thunks would utilize around 16 bits of the
    address space, thereby gaining entropy. For a BTB that uses
    32 bits for indexing, dynamic thunks could provide better prediction
    accuracy over fixed thunks.
    
    Have ITS thunks be variable sized and use EXECMEM_MODULE_TEXT such that
    they are both more flexible (got to extend them later) and live in 2M TLBs,
    just like kernel code, avoiding undue TLB pressure.
    
      [ pawan: CONFIG_EXECMEM and CONFIG_EXECMEM_ROX are not supported on
               backport kernel, made changes to use module_alloc() and
               set_memory_*() for dynamic thunks. ]
    
    Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org>
    Signed-off-by: Pawan Gupta <pawan.kumar.gupta@linux.intel.com>
    Signed-off-by: Dave Hansen <dave.hansen@linux.intel.com>
    Reviewed-by: Alexandre Chartre <alexandre.chartre@oracle.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

x86/mm: Eliminate window where TLB flushes may be inadvertently skipped [+ + +]

Author: Dave Hansen <dave.hansen@linux.intel.com>
Date:   Thu May 8 15:41:32 2025 -0700

    x86/mm: Eliminate window where TLB flushes may be inadvertently skipped
    
    commit fea4e317f9e7e1f449ce90dedc27a2d2a95bee5a upstream.
    
    tl;dr: There is a window in the mm switching code where the new CR3 is
    set and the CPU should be getting TLB flushes for the new mm.  But
    should_flush_tlb() has a bug and suppresses the flush.  Fix it by
    widening the window where should_flush_tlb() sends an IPI.
    
    Long Version:
    
    === History ===
    
    There were a few things leading up to this.
    
    First, updating mm_cpumask() was observed to be too expensive, so it was
    made lazier.  But being lazy caused too many unnecessary IPIs to CPUs
    due to the now-lazy mm_cpumask().  So code was added to cull
    mm_cpumask() periodically[2].  But that culling was a bit too aggressive
    and skipped sending TLB flushes to CPUs that need them.  So here we are
    again.
    
    === Problem ===
    
    The too-aggressive code in should_flush_tlb() strikes in this window:
    
            // Turn on IPIs for this CPU/mm combination, but only
            // if should_flush_tlb() agrees:
            cpumask_set_cpu(cpu, mm_cpumask(next));
    
            next_tlb_gen = atomic64_read(&next->context.tlb_gen);
            choose_new_asid(next, next_tlb_gen, &new_asid, &need_flush);
            load_new_mm_cr3(need_flush);
            // ^ After 'need_flush' is set to false, IPIs *MUST*
            // be sent to this CPU and not be ignored.
    
            this_cpu_write(cpu_tlbstate.loaded_mm, next);
            // ^ Not until this point does should_flush_tlb()
            // become true!
    
    should_flush_tlb() will suppress TLB flushes between load_new_mm_cr3()
    and writing to 'loaded_mm', which is a window where they should not be
    suppressed.  Whoops.
    
    === Solution ===
    
    Thankfully, the fuzzy "just about to write CR3" window is already marked
    with loaded_mm==LOADED_MM_SWITCHING.  Simply checking for that state in
    should_flush_tlb() is sufficient to ensure that the CPU is targeted with
    an IPI.
    
    This will cause more TLB flush IPIs.  But the window is relatively small
    and I do not expect this to cause any kind of measurable performance
    impact.
    
    Update the comment where LOADED_MM_SWITCHING is written since it grew
    yet another user.
    
    Peter Z also raised a concern that should_flush_tlb() might not observe
    'loaded_mm' and 'is_lazy' in the same order that switch_mm_irqs_off()
    writes them.  Add a barrier to ensure that they are observed in the
    order they are written.
    
    Signed-off-by: Dave Hansen <dave.hansen@linux.intel.com>
    Acked-by: Rik van Riel <riel@surriel.com>
    Link: https://lore.kernel.org/oe-lkp/202411282207.6bd28eae-lkp@intel.com/ [1]
    Fixes: 6db2526c1d69 ("x86/mm/tlb: Only trim the mm_cpumask once a second") [2]
    Reported-by: Stephen Dolan <sdolan@janestreet.com>
    Cc: stable@vger.kernel.org
    Acked-by: Ingo Molnar <mingo@kernel.org>
    Acked-by: Peter Zijlstra (Intel) <peterz@infradead.org>
    Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

x86/speculation: Add a conditional CS prefix to CALL_NOSPEC [+ + +]

Author: Pawan Gupta <pawan.kumar.gupta@linux.intel.com>
Date:   Fri Feb 28 18:35:58 2025 -0800

    x86/speculation: Add a conditional CS prefix to CALL_NOSPEC
    
    commit 052040e34c08428a5a388b85787e8531970c0c67 upstream.
    
    Retpoline mitigation for spectre-v2 uses thunks for indirect branches. To
    support this mitigation compilers add a CS prefix with
    -mindirect-branch-cs-prefix. For an indirect branch in asm, this needs to
    be added manually.
    
    CS prefix is already being added to indirect branches in asm files, but not
    in inline asm. Add CS prefix to CALL_NOSPEC for inline asm as well. There
    is no JMP_NOSPEC for inline asm.
    
    Reported-by: Josh Poimboeuf <jpoimboe@kernel.org>
    Signed-off-by: Pawan Gupta <pawan.kumar.gupta@linux.intel.com>
    Signed-off-by: Ingo Molnar <mingo@kernel.org>
    Cc: Andrew Cooper <andrew.cooper3@citrix.com
    Cc: Linus Torvalds <torvalds@linux-foundation.org>
    Cc: Peter Zijlstra <peterz@infradead.org>
    Link: https://lore.kernel.org/r/20250228-call-nospec-v3-2-96599fed0f33@linux.intel.com
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

x86/speculation: Remove the extra #ifdef around CALL_NOSPEC [+ + +]

Author: Pawan Gupta <pawan.kumar.gupta@linux.intel.com>
Date:   Thu Mar 20 11:13:15 2025 -0700

    x86/speculation: Remove the extra #ifdef around CALL_NOSPEC
    
    commit c8c81458863ab686cda4fe1e603fccaae0f12460 upstream.
    
    Commit:
    
      010c4a461c1d ("x86/speculation: Simplify and make CALL_NOSPEC consistent")
    
    added an #ifdef CONFIG_RETPOLINE around the CALL_NOSPEC definition. This is
    not required as this code is already under a larger #ifdef.
    
    Remove the extra #ifdef, no functional change.
    
    vmlinux size remains same before and after this change:
    
     CONFIG_RETPOLINE=y:
          text       data        bss         dec        hex    filename
      25434752    7342290    2301212    35078254    217406e    vmlinux.before
      25434752    7342290    2301212    35078254    217406e    vmlinux.after
    
     # CONFIG_RETPOLINE is not set:
          text       data        bss         dec        hex    filename
      22943094    6214994    1550152    30708240    1d49210    vmlinux.before
      22943094    6214994    1550152    30708240    1d49210    vmlinux.after
    
      [ pawan: s/CONFIG_MITIGATION_RETPOLINE/CONFIG_RETPOLINE/ ]
    
    Signed-off-by: Pawan Gupta <pawan.kumar.gupta@linux.intel.com>
    Signed-off-by: Ingo Molnar <mingo@kernel.org>
    Reviewed-by: Josh Poimboeuf <jpoimboe@kernel.org>
    Link: https://lore.kernel.org/r/20250320-call-nospec-extra-ifdef-v1-1-d9b084d24820@linux.intel.com
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

x86/speculation: Simplify and make CALL_NOSPEC consistent [+ + +]

Author: Pawan Gupta <pawan.kumar.gupta@linux.intel.com>
Date:   Fri Feb 28 18:35:43 2025 -0800

    x86/speculation: Simplify and make CALL_NOSPEC consistent
    
    commit cfceff8526a426948b53445c02bcb98453c7330d upstream.
    
    CALL_NOSPEC macro is used to generate Spectre-v2 mitigation friendly
    indirect branches. At compile time the macro defaults to indirect branch,
    and at runtime those can be patched to thunk based mitigations.
    
    This approach is opposite of what is done for the rest of the kernel, where
    the compile time default is to replace indirect calls with retpoline thunk
    calls.
    
    Make CALL_NOSPEC consistent with the rest of the kernel, default to
    retpoline thunk at compile time when CONFIG_RETPOLINE is
    enabled.
    
      [ pawan: s/CONFIG_MITIGATION_RETPOLINE/CONFIG_RETPOLINE/ ]
    
    Signed-off-by: Pawan Gupta <pawan.kumar.gupta@linux.intel.com>
    Signed-off-by: Ingo Molnar <mingo@kernel.org>
    Cc: Andrew Cooper <andrew.cooper3@citrix.com
    Cc: Linus Torvalds <torvalds@linux-foundation.org>
    Cc: Peter Zijlstra <peterz@infradead.org>
    Link: https://lore.kernel.org/r/20250228-call-nospec-v3-1-96599fed0f33@linux.intel.com
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

xenbus: Use kref to track req lifetime [+ + +]

Author: Jason Andryuk <jason.andryuk@amd.com>
Date:   Tue May 6 17:09:33 2025 -0400

    xenbus: Use kref to track req lifetime
    
    commit 1f0304dfd9d217c2f8b04a9ef4b3258a66eedd27 upstream.
    
    Marek reported seeing a NULL pointer fault in the xenbus_thread
    callstack:
    BUG: kernel NULL pointer dereference, address: 0000000000000000
    RIP: e030:__wake_up_common+0x4c/0x180
    Call Trace:
     <TASK>
     __wake_up_common_lock+0x82/0xd0
     process_msg+0x18e/0x2f0
     xenbus_thread+0x165/0x1c0
    
    process_msg+0x18e is req->cb(req).  req->cb is set to xs_wake_up(), a
    thin wrapper around wake_up(), or xenbus_dev_queue_reply().  It seems
    like it was xs_wake_up() in this case.
    
    It seems like req may have woken up the xs_wait_for_reply(), which
    kfree()ed the req.  When xenbus_thread resumes, it faults on the zero-ed
    data.
    
    Linux Device Drivers 2nd edition states:
    "Normally, a wake_up call can cause an immediate reschedule to happen,
    meaning that other processes might run before wake_up returns."
    ... which would match the behaviour observed.
    
    Change to keeping two krefs on each request.  One for the caller, and
    one for xenbus_thread.  Each will kref_put() when finished, and the last
    will free it.
    
    This use of kref matches the description in
    Documentation/core-api/kref.rst
    
    Link: https://lore.kernel.org/xen-devel/ZO0WrR5J0xuwDIxW@mail-itl/
    Reported-by: Marek Marczykowski-Górecki <marmarek@invisiblethingslab.com>
    Fixes: fd8aa9095a95 ("xen: optimize xenbus driver for multiple concurrent xenstore accesses")
    Cc: stable@vger.kernel.org
    Signed-off-by: Jason Andryuk <jason.andryuk@amd.com>
    Reviewed-by: Juergen Gross <jgross@suse.com>
    Signed-off-by: Juergen Gross <jgross@suse.com>
    Message-ID: <20250506210935.5607-1-jason.andryuk@amd.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>