Cross Reference: /xsrc/external/mit/xorg-server/dist/doc/smartsched

05b261ecSmrg			Client Scheduling in X
05b261ecSmrg			    Keith Packard
05b261ecSmrg			       SuSE
05b261ecSmrg			     10/28/99
05b261ecSmrg
05b261ecSmrgHistory:
05b261ecSmrg
05b261ecSmrgSince the original X server was written at Digital in 1987, the OS and DIX
05b261ecSmrglayers shared responsibility for scheduling the order to service
05b261ecSmrgclient requests.  The original design was simplistic; under the maximum
05b261ecSmrgfirst make it work, then make it work well, this was a good idea.  Now
05b261ecSmrgthat we have a bit more experience with X applications, it's time to
05b261ecSmrgrethink the design.
05b261ecSmrg
05b261ecSmrgThe basic dispatch loop in DIX looks like:
05b261ecSmrg
05b261ecSmrg	for (;;)
05b261ecSmrg	{
05b261ecSmrg		nready = WaitForSomething (...);
05b261ecSmrg		while (nready--)
05b261ecSmrg		{
05b261ecSmrg			isItTimeToYield = FALSE;
05b261ecSmrg			while (!isItTimeToYield)
05b261ecSmrg			{
05b261ecSmrg				if (!ReadRequestFromClient (...))
05b261ecSmrg					break;
05b261ecSmrg				(execute request);
05b261ecSmrg			}
05b261ecSmrg		}
05b261ecSmrg	}
05b261ecSmrg
05b261ecSmrgWaitForSomething looks like:
05b261ecSmrg
05b261ecSmrg	for (;;)
05b261ecSmrg		if (ANYSET (ClientsWithInput))
05b261ecSmrg			return popcount (ClientsWithInput);
05b261ecSmrg		select (...)
05b261ecSmrg		compute clientsReadable from select result;
05b261ecSmrg		return popcount (clientsReadable)
05b261ecSmrg	}
05b261ecSmrg
05b261ecSmrgReadRequestFromClient looks like:
05b261ecSmrg
05b261ecSmrg	if (!fullRequestQueued)
05b261ecSmrg	{
05b261ecSmrg		read ();
05b261ecSmrg		if (!fullRequestQueued)
05b261ecSmrg		{
05b261ecSmrg			remove from ClientsWithInput;
05b261ecSmrg			timesThisConnection = 0;
05b261ecSmrg			return 0;
05b261ecSmrg		}
05b261ecSmrg	}
05b261ecSmrg	if (twoFullRequestsQueued)
05b261ecSmrg		add to ClientsWithInput;
05b261ecSmrg
05b261ecSmrg	if (++timesThisConnection >= 10)
05b261ecSmrg	{
05b261ecSmrg		isItTimeToYield = TRUE;
05b261ecSmrg		timesThisConnection = 0;
05b261ecSmrg	}
05b261ecSmrg	return 1;
05b261ecSmrg
05b261ecSmrgHere's what happens in this code:
05b261ecSmrg
05b261ecSmrgWith a single client executing a stream of requests:
05b261ecSmrg
05b261ecSmrg	A client sends a packet of requests to the server.
05b261ecSmrg
05b261ecSmrg	WaitForSomething wakes up from select and returns that client
05b261ecSmrg	to Dispatch
05b261ecSmrg
05b261ecSmrg	Dispatch calls ReadRequestFromClient which reads a buffer (4K)
05b261ecSmrg	full of requests from the client
05b261ecSmrg
05b261ecSmrg	The server executes requests from this buffer until it emptys,
05b261ecSmrg	in two stages -- 10 requests at a time are executed in the
05b261ecSmrg	inner Dispatch loop, a buffer full of requests are executed
05b261ecSmrg	because WaitForSomething immediately returns if any clients
05b261ecSmrg	have complete requests pending in their input queues.
05b261ecSmrg
05b261ecSmrg	When the buffer finally emptys, the next call to ReadRequest
05b261ecSmrg	FromClient will return zero and Dispatch will go back to
05b261ecSmrg	WaitForSomething; now that the client has no requests pending,
05b261ecSmrg	WaitForSomething will block in select again.  If the client
05b261ecSmrg	is active, this select will immediately return that client
05b261ecSmrg	as ready to read.
05b261ecSmrg
05b261ecSmrgWith multiple clients sending streams of requests, the sequence
05b261ecSmrgof operations is similar, except that ReadRequestFromClient will
05b261ecSmrgset isItTimeToYield after each 10 requests executed causing the
05b261ecSmrgserver to round-robin among the clients with available requests.
05b261ecSmrg
05b261ecSmrgIt's important to realize here that any complete requests which have been
05b261ecSmrgread from clients will be executed before the server will use select again
05b261ecSmrgto discover input from other clients.  A single busy client can easily
05b261ecSmrgmonopolize the X server.
05b261ecSmrg
05b261ecSmrgSo, the X server doesn't share well with clients which are more interactive
05b261ecSmrgin nature.
05b261ecSmrg
05b261ecSmrgThe X server executes at most a buffer full of requests before again heading
05b261ecSmrginto select; ReadRequestFromClient causes the server to yield when the
05b261ecSmrgclient request buffer doesn't contain a complete request.  When
05b261ecSmrgthat buffer is executed quickly, the server spends a lot of time
05b261ecSmrgin select discovering that the same client again has input ready.  Thus
05b261ecSmrgthe server also runs busy clients less efficiently than is would be
05b261ecSmrgpossible.
05b261ecSmrg
05b261ecSmrgWhat to do.
05b261ecSmrg
05b261ecSmrgThere are several things evident from the above discussion:
05b261ecSmrg
05b261ecSmrg 1	The server has a poor metric for deciding how much work it
05b261ecSmrg	should do at one time on behalf of a particular client.
05b261ecSmrg
05b261ecSmrg 2	The server doesn't call select often enough to detect less
05b261ecSmrg 	aggressive clients in the face of busy clients, especially
05b261ecSmrg	when those clients are executing slow requests.
05b261ecSmrg
05b261ecSmrg 3	The server calls select too often when executing fast requests.
05b261ecSmrg
05b261ecSmrg 4	Some priority scheme is needed to keep interactive clients
05b261ecSmrg 	responding to the user.
05b261ecSmrg
05b261ecSmrgAnd, there are some assumptions about how X applications work:
05b261ecSmrg
05b261ecSmrg 1	Each X request is executed relatively quickly; a request-granularity
05b261ecSmrg 	is good enough for interactive response almost all of the time.
05b261ecSmrg
05b261ecSmrg 2	X applications receiving mouse/keyboard events are likely to
05b261ecSmrg 	warrant additional attention from the X server.
05b261ecSmrg
05b261ecSmrgInstead of a request-count metric for work, a time-based metric should be
05b261ecSmrgused.  The server should select a reasonable time slice for each client
05b261ecSmrgand execute requests for the entire timeslice before yielding to
05b261ecSmrganother client.
05b261ecSmrg
05b261ecSmrgInstead of returning immediately from WaitForSomething if clients have
05b261ecSmrgcomplete requests queued, the server should go through select each
05b261ecSmrgtime and gather as many ready clients as possible.  This involves
05b261ecSmrgpolling instead of blocking and adding the ClientsWithInput to
05b261ecSmrgclientsReadable after the select returns.
05b261ecSmrg
05b261ecSmrgInstead of yielding when the request buffer is empty for a particular
05b261ecSmrgclient, leave the yielding to the upper level scheduling and allow
05b261ecSmrgthe server to try and read again from the socket.  If the client
05b261ecSmrgis busy, another buffer full of requests will already be waiting
05b261ecSmrgto be delivered thus avoiding the call through select and the
05b261ecSmrgadditional overhead in WaitForSomething.
05b261ecSmrg
05b261ecSmrgFinally, the dispatch loop should not simply execute requests from the
05b261ecSmrgfirst available client, instead each client should be prioritized with
05b261ecSmrgbusy clients penalized and clients receiving user events praised.
05b261ecSmrg
05b261ecSmrgHow it's done:
05b261ecSmrg
05b261ecSmrgPolling the current time of day from the OS is too expensive to
05b261ecSmrgbe done at each request boundary, so instead an interval timer is
05b261ecSmrgset allowing the server to track time changes by counting invocations
05b261ecSmrgof the related signal handler.  Instead of using the wall time for
05b261ecSmrgthis purpose, the process CPU time is used instead.  This serves
05b261ecSmrgtwo purposes -- first, it allows the server to consume no CPU cycles
05b261ecSmrgwhen idle, second it avoids conflicts with SIGALRM usage in other
05b261ecSmrgparts of the server code.  It's not without problems though; other
05b261ecSmrgCPU intensive processes on the same machine can reduce interactive
05b261ecSmrgresponse time within the X server.  The dispatch loop can now
05b261ecSmrgcalculate an approximate time value using the number of signals
05b261ecSmrgreceived.  The granularity of the timer sets the scheduling jitter,
05b261ecSmrgat 20ms it's only occasionally noticeable.
05b261ecSmrg
05b261ecSmrgThe changes to WaitForSomething and ReadRequestFromClient are
05b261ecSmrgstraightforward, adjusting when select is called and avoiding
05b261ecSmrgsetting isItTimeToYield too often.
05b261ecSmrg
05b261ecSmrgThe dispatch loop changes are more extensive, now instead of
05b261ecSmrgexecuting requests from all available clients, a single client
05b261ecSmrgis chosen after each call to WaitForSomething, requests are
05b261ecSmrgexecuted for that client and WaitForSomething is called again.
05b261ecSmrg
05b261ecSmrgEach client is assigned a priority, the dispatch loop chooses the
05b261ecSmrgclient with the highest priority to execute.  Priorities are
05b261ecSmrgupdated in three ways:
05b261ecSmrg
05b261ecSmrg 1.	Clients which consume their entire slice are penalized
05b261ecSmrg 	by having their priority reduced by one until they
05b261ecSmrg	reach some minimum value.
05b261ecSmrg
05b261ecSmrg 2.	Clients which have executed no requests for some time
05b261ecSmrg 	are praised by having their priority raised until they
05b261ecSmrg	return to normal priority.
05b261ecSmrg
05b261ecSmrg 3.	Clients which receive user input are praised by having
05b261ecSmrg 	their priority rased until they reach some maximal
05b261ecSmrg	value, above normal priority.
05b261ecSmrg
05b261ecSmrgThe effect of these changes is to both improve interactive application
05b261ecSmrgresponse and benchmark numbers at the same time.