Home | History | Annotate | Download | only in kern
History log of /src/sys/kern/kern_pmf.c
RevisionDateAuthorComments
 1.51  24-Aug-2022  riastradh kern_pmf.c: Fix whitespace, KNF, static.

No functional change intended.
 1.50  24-Aug-2022  riastradh pmf(9): *_child_register never fails. Make it return void.

No kernel bump because this isn't documented or used in any modules,
only in dev/pci/pci.c and dev/cardbus/cardbus.c which are as far as I
know always statically linked into the kernel.

The next change, however, will require a revbump -- to make
pmf_device_register return void so we can prune vast swaths of dead
error branches.
 1.49  24-Aug-2022  riastradh kern: device_pmf_driver_register never fails, so make it return void.

No ABI bump despite change to device.h because this is used only
inside autoconf.
 1.48  28-Mar-2022  riastradh sys: Split struct device into a private device_impl.h.

Include this only inside autoconf itself, and a few files that abuse
autoconf in ways I can't confidently make easy fixes for.

XXX kernel ABI change requires bump -- no more use of struct device
internals allowed, previously done by some drivers
 1.47  01-Jul-2021  blymn Back out fix for kern_pmf.c calling a null if_stop and apply a fix
suggested by Jared McNeill which sets if_stop to a stub function
which means that more than just the pmf is protected from the NULL call.
 1.46  30-Jun-2021  blymn Prevent kernel panic on sleep if network interface driver does not
have if_stop defined. This is a common problem with usb adaptors.
This is a temporary fix, the printf needs to be converted to an
assert once the drivers are cleaned up.
 1.45  11-Jun-2020  thorpej branches: 1.45.6;
Oops, missed a couple.
 1.44  11-Jun-2020  thorpej Update for proplib(3) API changes.
 1.43  12-May-2020  jdolecek need to take IFNET_LOCK() around if_stop (on suspend) and if_init (on resume)
calls, those need to read and/or manipulate if_flags and hence need
the lock for IFEF_MPSAFE drivers; the drivers can't do IFNET_LOCK() themselves,
because the ioctl path call these hooks with the lock held

fixes KASSERT() in xennet(4) while investigating PR port-xen/55207
 1.42  20-Apr-2020  ad Rename buf_syncwait() to vfs_syncwait(), and have it wait on v_numoutput
rather than BC_BUSY. Removes the dependency on bufhash.
 1.41  23-Feb-2020  ad branches: 1.41.4;
shutdown_all: take kernel_lock now that kern_reboot() doesn't.
 1.40  08-Apr-2018  mlelstv branches: 1.40.2; 1.40.8;
expose pmf debug switches with sysctl.
 1.39  07-Jul-2016  msaitoh branches: 1.39.16;
KNF. Remove extra spaces. No functional change.
 1.38  20-Jun-2016  knakahara apply if_start_lock() to L2 callers which call ifp->if_start() of device derivers
 1.37  13-Feb-2015  maxv Remove this MALLOC_DEFINE (M_PMF unused).
 1.36  13-Nov-2012  chs branches: 1.36.12; 1.36.14;
in pmf_system_shutdown(), don't do anything if we're panicing.
this prevents a hang trying to shut down other CPUs on x86,
and in general we could be called in any context from a panic
so it's best to skip unnecessary operations in that case.
 1.35  05-Jun-2011  dsl branches: 1.35.2; 1.35.12;
Don't directly call sys_sync() from random bits of code, instead
add do_sys_sync() that takes an 'lwp' (for l_cred) as an argument.
Explicitly pass &lwp0 rather than NULL and expecting sys_sync to
substitute some random lwp.
 1.34  27-Apr-2011  rmind branches: 1.34.2;
G/C M_EMULDATA
 1.33  24-Feb-2010  dyoung branches: 1.33.2; 1.33.4;
A pointer typedef entails trading too much flexibility to declare const
and non-const types, and the kernel uses both const and non-const
PMF qualifiers and device suspensors, so change the pmf_qual_t and
device_suspensor_t typedefs from "pointers to const" to non-pointer,
non-const types.
 1.32  17-Feb-2010  dyoung Initialize the temporary pmf_qual_t in pmf_device_subtree_release()
to avoid a failed ds != NULL assertion, later.
 1.31  08-Jan-2010  dyoung branches: 1.31.2;
Expand PMF_FN_* macros.
 1.30  27-Oct-2009  rmind - Use pool(9) for pmf_event_workitem_t, instead of pool_cache(9). Still,
meta-data of this pool takes more space than the actual data..

- Reduce lowat/hiwat to 1..8, since intensity is very low.

- Remove unused pew_next_free from pmf_event_workitem_t.
 1.29  16-Sep-2009  dyoung In pmf(9), improve the implementation of device self-suspension
and make suspension by self, by drvctl(8), and by ACPI system sleep
play nice together. Start solidifying some temporary API changes.

1. Extract a new header file, <sys/device_if.h>, from <sys/device.h> and
#include it from <sys/pmf.h> instead of <sys/device.h> to break the
circular dependency between <sys/device.h> and <sys/pmf.h>.

2. Introduce pmf_qual_t, an aggregate of qualifications on a PMF
suspend/resume call. Start to replace instances of PMF_FN_PROTO,
PMF_FN_ARGS, et cetera, with a pmf_qual_t.

3. Introduce the notion of a "suspensor," an entity that holds a
device in suspension. More than one suspensor may hold a device
at once. A device stays suspended as long as at least one
suspensor holds it. A device resumes when the last suspensor
releases it.

Currently, the kernel defines three suspensors,

3a the system-suspensor: for system suspension, initiated
by 'sysctl -w machdep.sleep_state=3', by lid closure, by
power-button press, et cetera,

3b the drvctl-suspensor: for device suspension by /dev/drvctl
ioctl, e.g., drvctl -S sip0.

3c the system self-suspensor: for device drivers that suspend
themselves and their children. Several drivers for network
interfaces put the network device to sleep while it is not
administratively up, that is, after the kernel calls if_stop(,
1). The self-suspensor should not be used directly. See
the description of suspensor delegates, below.

A suspensor can have one or more "delegates". A suspensor can
release devices that its delegates hold suspended. Right now,
only the system self-suspensor has delegates. For each device
that a self-suspending driver attaches, it creates the device's
self-suspensor, a delegate of the system self-suspensor.

Suspensors stop a system-wide suspend/resume cycle from waking
devices that the operator put to sleep with drvctl before the cycle.
They also help self-suspension to work more simply, safely, and in
accord with expectations.

4. Add the notion of device activation level, devact_level_t,
and a routine for checking the current activation level,
device_activation(). Current activation levels are DEVACT_LEVEL_BUS,
DEVACT_LEVEL_DRIVER, and DEVACT_LEVEL_CLASS, which respectively
indicate that the device's bus is active, that the bus and device are
active, and that the bus, device, and the functions of the device's
class (network, audio) are active.

Suspend/resume calls can be qualified with a devact_level_t.
The power-management framework treats a devact_level_t that
qualifies a device suspension as the device's current activation
level; it only runs hooks to reduce the activation level from
the presumed current level to the fully suspended state. The
framework treats a devact_level_t qualifying device resumption
as the target activation level; it only runs hooks to raise the
activation level to the target.

5. Use pmf_qual_t, devact_level_t, and self-suspensors in several
drivers.

6. Temporarily add an unused power-management workqueue that I will
remove or replace, soon.
 1.28  08-Jul-2009  dyoung pmf_event_inject(9) may be called from interrupt context, so we
must not allocate a pmf_event_workitem_t using kmem_alloc(9). Use
pool_cache(9), instead, because it is safe in interrupt context.
Thanks, rmind@, for catching the problem and suggesting the solution.
 1.27  26-Jun-2009  dyoung Switch to kmem(9).

(void *)pew is one way to get a struct work *, but let's
write&pew->pew_work, instead. It is more defensive and persuasive.

Make miscellaneous changes in support of tearing down arbitrary
stacks of filesystems and devices during shutdown:

1 Move struct shutdown_state, shutdown_first(), and shutdown_next(),
from kern_pmf.c to subr_autoconf.c. Rename detach_all() to
config_detach_all(), and move it from kern_pmf.c to subr_autoconf.c.
Export all of those routines.

2 In pmf_system_shutdown(), do not suspend user process scheduling, and
do not detach all devices: I am going to do that in cpu_reboot(),
instead. (Soon I will do it in an MI cpu_reboot() routine.) Do still
call PMF shutdown hooks.

3 In config_detach(), add a DIAGNOSTIC assertion: if we're exiting
config_detach() at the bottom, alldevs_nwrite had better not be 0,
because config_detach() is a writer of the device list.

4 In deviter_release(), check to see if we're iterating the device list
for reading, *first*, and if so, decrease the number of readers. Used
to be that if we happened to be reading during shutdown, we ran the
shutdown branch. Thus the number of writers reached 0, the number
of readers remained > 0, and no writer could iterate again. Under
certain circumstances that would cause a hang during shutdown.
 1.26  17-Apr-2009  dyoung Do not interleave device detachment with device shutdown. Instead, try
over and over to detach all of the devices. Stop when we cannot detach
even a single device in a cycle. Call shutdown hooks on all of the
devices that remain attached.

This is another step toward the detach/unmount cycle that will help us
tear down arbitrary stacks of filesystems, ccd(4), raid(4), and vnd(4).
 1.25  16-Apr-2009  skrll 0 -> NULL
 1.24  07-Apr-2009  dyoung Cosmetic: join lines.
 1.23  02-Apr-2009  dyoung Take out a noisy debug statement that slipped in with device-detachment
at shutdown.
 1.22  02-Apr-2009  dyoung During shutdown, detach devices in an orderly fashion.

Call the detach routine for every device in the device tree, starting
with the leaves and moving toward the root, expecting that each
(pseudo-)device driver will use the opportunity to gracefully commit
outstandings transactions to the underlying (pseudo-)device and to
relinquish control of the hardware to the system BIOS.

Detaching devices is not suitable for every shutdown: in an emergency,
or if the system state is inconsistent, we should resort to a fast,
simple shutdown that uses only the pmf(9) shutdown hooks and the
(deprecated) shutdownhooks. For now, if the flag RB_NOSYNC is set in
boothowto, opt for the fast, simple shutdown.

Add a device flag, DVF_DETACH_SHUTDOWN, that indicates by its presence
that it is safe to detach a device during shutdown. Introduce macros
CFATTACH_DECL3() and CFATTACH_DECL3_NEW() for creating autoconf
attachments with default device flags. Add DVF_DETACH_SHUTDOWN
to configuration attachments for atabus(4), atw(4) at cardbus(4),
cardbus(4), cardslot(4), com(4) at isa(4), elanpar(4), elanpex(4),
elansc(4), gpio(4), npx(4) at isa(4), nsphyter(4), pci(4), pcib(4),
pcmcia(4), ppb(4), sip(4), wd(4), and wdc(4) at isa(4).

Add a device-detachment "reason" flag, DETACH_SHUTDOWN, that tells the
autoconf code and a device driver that the reason for detachment is
system shutdown.

Add a sysctl, kern.detachall, that tells the system to try to detach
every device at shutdown, regardless of any device's DVF_DETACH_SHUTDOWN
flag. The default for kern.detachall is 0. SET IT TO 1, PLEASE, TO
HELP TEST AND DEBUG DEVICE DETACHMENT AT SHUTDOWN.

This is a work in progress. In future work, I aim to treat
pseudo-devices more thoroughly, and to gracefully tear down a stack of
(pseudo-)disk drivers and filesystems, including cgd(4), vnd(4), and
raid(4) instances at shutdown.

Also commit some changes that are not easily untangled from the rest:

(1) begin to simplify device_t locking: rename struct pmf_private to
device_lock, and incorporate device_lock into struct device.

(2) #include <sys/device.h> in sys/pmf.h in order to get some
definitions that it needs. Stop unnecessarily #including <sys/device.h>
in sys/arch/x86/include/pic.h to keep the amd64, xen, and i386 releases
building.
 1.21  06-Feb-2009  dyoung branches: 1.21.2;
Extract a subroutine pmf_device_resume_descendants() from
pmf_device_resume_subtree(), for re-use later.
 1.20  17-Jun-2008  tsutsui branches: 1.20.4;
Include <sys/sched.h> before <sys/syscallargs.h> for cpuset_t.
 1.19  05-May-2008  jmcneill branches: 1.19.2; 1.19.4;
Use 2-clause license.
 1.18  31-Mar-2008  xtraeme branches: 1.18.2; 1.18.4;
Remove useless returns at the end of void functions.
 1.17  12-Mar-2008  dyoung Use device_t and its accessors throughout. Use aprint_*_dev().

Improve PMF-ability.

Add a 'flags' argument to suspend/resume handlers and
callers such as pmf_system_suspend().

Define a flag, PMF_F_SELF, which indicates to PMF that a
device is suspending/resuming itself. Add helper routines,
pmf_device_suspend_self(dev) and pmf_device_resume_self(dev),
that call pmf_device_suspend(dev, PMF_F_SELF) and
pmf_device_resume(dev, PMF_F_SELF), respectively. Use
PMF_F_SELF to suspend/resume self in ath(4), audio(4),
rtw(4), and sip(4).

In ath(4) and in rtw(4), replace the icky sc_enable/sc_disable
callbacks, provided by the bus front-end, with
self-suspension/resumption. Also, clean up the bus
front-ends. Make sure that the interrupt handler is
disestablished during suspension. Get rid of driver-private
flags (e.g., RTW_F_ENABLED, ath_softc->sc_invalid); use
device_is_active()/device_has_power() calls, instead.

In the network-class suspend handler, call if_stop(, 0)
instead of if_stop(, 1), because the latter is superfluous
(bus- and driver-suspension hooks will 'disable' the NIC),
and it may cause recursion.

In the network-class resume handler, prevent infinite
recursion through if_init() by getting out early if we are
self-suspending (PMF_F_SELF).

rtw(4) improvements:

Destroy rtw(4) callouts when we detach it. Make rtw at
pci detachable. Print some more information with the "rx
frame too long" warning.

Remove activate() methods:

Get rid of rtw_activate() and ath_activate(). The device
activate() methods are not good for much these days.

Make ath at cardbus resume with crypto functions intact:

Introduce a boolean device property, "pmf-powerdown". If
pmf-powerdown is present and false, it indicates that a
bus back-end should not remove power from a device.

Honor this property in cardbus_child_suspend().

Set this property to 'false' in ath_attach(), since removing
power from an ath at cardbus seems to lobotomize the WPA
crypto engine. XXX Should the pmf-powerdown property
propagate toward the root of the device tree?

Miscellaneous ath(4) changes:

Warn if ath(4) tries to write crypto keys to suspended
hardware.

Reduce differences between FreeBSD and NetBSD in ath(4)
multicast filter setup.

Make ath_printrxbuf() print an rx descriptor's status &
key index, to help debug crypto errors.

Shorten a staircase in ath_ioctl(). Don't check for
ieee80211_ioctl() return code ERESTART, it never happens.
 1.16  07-Mar-2008  dyoung PMF: synchronize device suspension and resumption.
 1.15  05-Mar-2008  dyoung Synchronize readers and writers of the device tree.

Add a device iterator object, deviter_t, and methods deviter_init(),
deviter_first(), and deviter_next() for visiting each device in
the device tree.

Take care not to re-shutdown a device in the event that the machine
panics during reboot and the operator types 'reboot' at the kernel
debugger prompt.

While I'm here, sprinkle PMF_FN_ARGS, PMF_FN_PROTO, et cetera.
 1.14  05-Mar-2008  dyoung Introduce PMF_FN_{ARGS,PROTO}1, and use PMF_FN_{ARGS,PROTO} more
widely, further helping me to introduce PMF API changes piecemeal.
 1.13  28-Feb-2008  drochner Extend the pmf suspend/resume hooks by a shutdown method, so drivers
can register a shutdown handler explicitely.
Install a pci bus shutdown handler which disables bus master accesses
for all childs, so the drivers don't need to care.
This will hopefully be sufficient to replace the shutdownhooks
(together with the powerhooks). (It has been suggested to use some
general event notification framework for shutdown handlers, but there
might be cases where shutdown handlers must be run in an order following
the device hierarchy, which wouldn't be easy with event handlers
not tied to drivers.)
approved by David Young
 1.12  20-Feb-2008  drochner branches: 1.12.2; 1.12.6;
On system suspend, first get rid of the X server because it doesn't
respect the kernel device tree. (It is arguably ugly to special-case
wscons here, but as long as there is only one driver to be dealt with
it is not worth to introduce another set of hooks.)
Resume the X server at the end of resume, if everything went well.
Acquire the big KERNEL_LOCK before the device tree is walked on
suspend, until after the walk on resume. This is needed to avoid
device accesses by secondary CPUs, and it effectively keeps user
programs from interfering with the suspend process. This might be
revisited when all drivers are using private locks for MP-safeness
(but FreeBSD still does the same afaics).
It should be unnecessary now to switch secondary CPUs offline in
the powerd suspend script.
 1.11  30-Jan-2008  jmcneill pmf_event_deregister was passing M_WAITOK to free instead of M_DEVBUF,
pointed out by simonb
 1.10  27-Dec-2007  jmcneill branches: 1.10.2; 1.10.4;
Don't call pmf_check_system_drivers on shutdown.
 1.9  23-Dec-2007  rmind branches: 1.9.2; 1.9.4;
OK, I was using whisky..
Fix previous - use foreach, and just return after first found entry.
The pmf_all_events list should not have duplicate entries (perhaps
pmf(9) should document this point).
 1.8  23-Dec-2007  rmind pmf_event_deregister: fix use-after-free.
Found by: CID 4754.
 1.7  21-Dec-2007  jmcneill Fix handling of directed pmf events.
 1.6  14-Dec-2007  jmcneill Introduce pmf_system_bus_resume. In the ACPI S3 resume path, use this to
separate powering up devices from restoring their state. This is required
on some machines where AcpiLeaveSleepState can fail due to an attempt to
access a powered off device.
 1.5  11-Dec-2007  jmcneill Use PRI_NONE for the event workqueue.
 1.4  11-Dec-2007  jmcneill Use malloc/free instead of kmem_alloc/kmem_free in pmf_event_inject, since
the latter is not safe to use from a softint. Pointed out by rmind.
 1.3  10-Dec-2007  xtraeme Remove useless returns at the end of void functions.
 1.2  09-Dec-2007  jmcneill branches: 1.2.2;
Merge jmcneill-pm branch.
 1.1  08-Dec-2007  jmcneill branches: 1.1.2; 1.1.4;
file kern_pmf.c was initially added on branch jmcneill-pm.
 1.1.4.1  11-Dec-2007  yamt sync with head.
 1.1.2.1  08-Dec-2007  jmcneill Rename pnp(9) -> pmf(9), as requested by many.
 1.2.2.2  02-Jan-2008  bouyer Sync with HEAD
 1.2.2.1  13-Dec-2007  bouyer Sync with HEAD
 1.9.4.2  26-Dec-2007  ad Sync with head.
 1.9.4.1  23-Dec-2007  ad file kern_pmf.c was added on branch vmlocking2 on 2007-12-26 19:17:22 +0000
 1.9.2.2  18-Feb-2008  mjf Sync with HEAD.
 1.9.2.1  23-Dec-2007  mjf file kern_pmf.c was added on branch mjf-devfs on 2008-02-18 21:06:46 +0000
 1.10.4.5  17-Mar-2008  yamt sync with head.
 1.10.4.4  27-Feb-2008  yamt sync with head.
 1.10.4.3  04-Feb-2008  yamt sync with head.
 1.10.4.2  21-Jan-2008  yamt sync with head
 1.10.4.1  27-Dec-2007  yamt file kern_pmf.c was added on branch yamt-lazymbuf on 2008-01-21 09:46:08 +0000
 1.10.2.3  23-Mar-2008  matt sync with HEAD
 1.10.2.2  09-Jan-2008  matt sync with HEAD
 1.10.2.1  27-Dec-2007  matt file kern_pmf.c was added on branch matt-armv6 on 2008-01-09 01:56:06 +0000
 1.12.6.3  29-Jun-2008  mjf Sync with HEAD.
 1.12.6.2  02-Jun-2008  mjf Sync with HEAD.
 1.12.6.1  03-Apr-2008  mjf Sync with HEAD.
 1.12.2.1  24-Mar-2008  keiichi sync with head.
 1.18.4.4  11-Mar-2010  yamt sync with head
 1.18.4.3  18-Jul-2009  yamt sync with head.
 1.18.4.2  04-May-2009  yamt sync with head.
 1.18.4.1  16-May-2008  yamt sync with head.
 1.18.2.1  18-May-2008  yamt sync with head.
 1.19.4.1  18-Jun-2008  simonb Sync with head.
 1.19.2.1  23-Jun-2008  wrstuden Sync w/ -current. 34 merge conflicts to follow.
 1.20.4.2  28-Apr-2009  skrll Sync with HEAD.
 1.20.4.1  03-Mar-2009  skrll Sync with HEAD.
 1.21.2.2  23-Jul-2009  jym Sync with HEAD.
 1.21.2.1  13-May-2009  jym Sync with HEAD.

Commit is split, to avoid a "too many arguments" protocol error.
 1.31.2.1  30-Apr-2010  uebayasi Sync with HEAD.
 1.33.4.1  06-Jun-2011  jruoho Sync with HEAD.
 1.33.2.2  12-Jun-2011  rmind sync with head
 1.33.2.1  31-May-2011  rmind sync with head
 1.34.2.1  23-Jun-2011  cherry Catchup with rmind-uvmplock merge.
 1.35.12.2  03-Dec-2017  jdolecek update from HEAD
 1.35.12.1  20-Nov-2012  tls Resync to 2012-11-19 00:00:00 UTC
 1.35.2.1  16-Jan-2013  yamt sync with (a bit old) head
 1.36.14.2  09-Jul-2016  skrll Sync with HEAD
 1.36.14.1  06-Apr-2015  skrll Sync with HEAD
 1.36.12.1  25-Mar-2015  snj Pull up following revision(s) (requested by maxv in ticket #617):
sys/kern/kern_malloc.c: revision 1.144, 1.145
sys/kern/kern_pmf.c: revision 1.37
sys/rump/librump/rumpkern/rump.c: revision 1.316
sys/uvm/uvm_extern.h: revision 1.193
sys/uvm/uvm_km.c: revision 1.139
Don't include <uvm/uvm_extern.h>
--
Kill kmeminit().
--
Remove this MALLOC_DEFINE (M_PMF unused).
 1.39.16.1  16-Apr-2018  pgoyette Sync with HEAD, resolve some conflicts
 1.40.8.1  29-Feb-2020  ad Sync with head.
 1.40.2.2  21-Apr-2020  martin Sync with HEAD
 1.40.2.1  08-Apr-2020  martin Merge changes from current as of 20200406
 1.41.4.1  25-Apr-2020  bouyer Sync with bouyer-xenpvh-base2 (HEAD)
 1.45.6.1  01-Aug-2021  thorpej Sync with HEAD.

RSS XML Feed