Cross Reference: /xsrc/external/mit/xorg-server.old/dist/doc/smartsched

706f2543Smrg			Client Scheduling in X
706f2543Smrg			    Keith Packard
706f2543Smrg			       SuSE
706f2543Smrg			     10/28/99
706f2543Smrg
706f2543SmrgHistory:
706f2543Smrg
706f2543SmrgSince the original X server was written at Digital in 1987, the OS and DIX
706f2543Smrglayers shared responsibility for scheduling the order to service
706f2543Smrgclient requests.  The original design was simplistic; under the maximum
706f2543Smrgfirst make it work, then make it work well, this was a good idea.  Now
706f2543Smrgthat we have a bit more experience with X applications, it's time to
706f2543Smrgrethink the design.
706f2543Smrg
706f2543SmrgThe basic dispatch loop in DIX looks like:
706f2543Smrg
706f2543Smrg	for (;;)
706f2543Smrg	{
706f2543Smrg		nready = WaitForSomething (...);
706f2543Smrg		while (nready--)
706f2543Smrg		{
706f2543Smrg			isItTimeToYield = FALSE;
706f2543Smrg			while (!isItTimeToYield)
706f2543Smrg			{
706f2543Smrg				if (!ReadRequestFromClient (...))
706f2543Smrg					break;
706f2543Smrg				(execute request);
706f2543Smrg			}
706f2543Smrg		}
706f2543Smrg	}
706f2543Smrg
706f2543SmrgWaitForSomething looks like:
706f2543Smrg
706f2543Smrg	for (;;)
706f2543Smrg		if (ANYSET (ClientsWithInput))
706f2543Smrg			return popcount (ClientsWithInput);
706f2543Smrg		select (...)
706f2543Smrg		compute clientsReadable from select result;
706f2543Smrg		return popcount (clientsReadable)
706f2543Smrg	}
706f2543Smrg
706f2543SmrgReadRequestFromClient looks like:
706f2543Smrg
706f2543Smrg	if (!fullRequestQueued)
706f2543Smrg	{
706f2543Smrg		read ();
706f2543Smrg		if (!fullRequestQueued)
706f2543Smrg		{
706f2543Smrg			remove from ClientsWithInput;
706f2543Smrg			timesThisConnection = 0;
706f2543Smrg			return 0;
706f2543Smrg		}
706f2543Smrg	}
706f2543Smrg	if (twoFullRequestsQueued)
706f2543Smrg		add to ClientsWithInput;
706f2543Smrg
706f2543Smrg	if (++timesThisConnection >= 10)
706f2543Smrg	{
706f2543Smrg		isItTimeToYield = TRUE;
706f2543Smrg		timesThisConnection = 0;
706f2543Smrg	}
706f2543Smrg	return 1;
706f2543Smrg
706f2543SmrgHere's what happens in this code:
706f2543Smrg
706f2543SmrgWith a single client executing a stream of requests:
706f2543Smrg
706f2543Smrg	A client sends a packet of requests to the server.
706f2543Smrg
706f2543Smrg	WaitForSomething wakes up from select and returns that client
706f2543Smrg	to Dispatch
706f2543Smrg
706f2543Smrg	Dispatch calls ReadRequestFromClient which reads a buffer (4K)
706f2543Smrg	full of requests from the client
706f2543Smrg
706f2543Smrg	The server executes requests from this buffer until it emptys,
706f2543Smrg	in two stages -- 10 requests at a time are executed in the
706f2543Smrg	inner Dispatch loop, a buffer full of requests are executed
706f2543Smrg	because WaitForSomething immediately returns if any clients
706f2543Smrg	have complete requests pending in their input queues.
706f2543Smrg
706f2543Smrg	When the buffer finally emptys, the next call to ReadRequest
706f2543Smrg	FromClient will return zero and Dispatch will go back to
706f2543Smrg	WaitForSomething; now that the client has no requests pending,
706f2543Smrg	WaitForSomething will block in select again.  If the client
706f2543Smrg	is active, this select will immediately return that client
706f2543Smrg	as ready to read.
706f2543Smrg
706f2543SmrgWith multiple clients sending streams of requests, the sequence
706f2543Smrgof operations is similar, except that ReadRequestFromClient will
706f2543Smrgset isItTimeToYield after each 10 requests executed causing the
706f2543Smrgserver to round-robin among the clients with available requests.
706f2543Smrg
706f2543SmrgIt's important to realize here that any complete requests which have been
706f2543Smrgread from clients will be executed before the server will use select again
706f2543Smrgto discover input from other clients.  A single busy client can easily
706f2543Smrgmonopolize the X server.
706f2543Smrg
706f2543SmrgSo, the X server doesn't share well with clients which are more interactive
706f2543Smrgin nature.
706f2543Smrg
706f2543SmrgThe X server executes at most a buffer full of requests before again heading
706f2543Smrginto select; ReadRequestFromClient causes the server to yield when the
706f2543Smrgclient request buffer doesn't contain a complete request.  When
706f2543Smrgthat buffer is executed quickly, the server spends a lot of time
706f2543Smrgin select discovering that the same client again has input ready.  Thus
706f2543Smrgthe server also runs busy clients less efficiently than is would be
706f2543Smrgpossible.
706f2543Smrg
706f2543SmrgWhat to do.
706f2543Smrg
706f2543SmrgThere are several things evident from the above discussion:
706f2543Smrg
706f2543Smrg 1	The server has a poor metric for deciding how much work it
706f2543Smrg	should do at one time on behalf of a particular client.
706f2543Smrg
706f2543Smrg 2	The server doesn't call select often enough to detect less
706f2543Smrg 	aggressive clients in the face of busy clients, especially
706f2543Smrg	when those clients are executing slow requests.
706f2543Smrg
706f2543Smrg 3	The server calls select too often when executing fast requests.
706f2543Smrg
706f2543Smrg 4	Some priority scheme is needed to keep interactive clients
706f2543Smrg 	responding to the user.
706f2543Smrg
706f2543SmrgAnd, there are some assumptions about how X applications work:
706f2543Smrg
706f2543Smrg 1	Each X request is executed relatively quickly; a request-granularity
706f2543Smrg 	is good enough for interactive response almost all of the time.
706f2543Smrg
706f2543Smrg 2	X applications receiving mouse/keyboard events are likely to
706f2543Smrg 	warrant additional attention from the X server.
706f2543Smrg
706f2543SmrgInstead of a request-count metric for work, a time-based metric should be
706f2543Smrgused.  The server should select a reasonable time slice for each client
706f2543Smrgand execute requests for the entire timeslice before yielding to
706f2543Smrganother client.
706f2543Smrg
706f2543SmrgInstead of returning immediately from WaitForSomething if clients have
706f2543Smrgcomplete requests queued, the server should go through select each
706f2543Smrgtime and gather as many ready clients as possible.  This involves
706f2543Smrgpolling instead of blocking and adding the ClientsWithInput to
706f2543SmrgclientsReadable after the select returns.
706f2543Smrg
706f2543SmrgInstead of yielding when the request buffer is empty for a particular
706f2543Smrgclient, leave the yielding to the upper level scheduling and allow
706f2543Smrgthe server to try and read again from the socket.  If the client
706f2543Smrgis busy, another buffer full of requests will already be waiting
706f2543Smrgto be delivered thus avoiding the call through select and the
706f2543Smrgadditional overhead in WaitForSomething.
706f2543Smrg
706f2543SmrgFinally, the dispatch loop should not simply execute requests from the
706f2543Smrgfirst available client, instead each client should be prioritized with
706f2543Smrgbusy clients penalized and clients receiving user events praised.
706f2543Smrg
706f2543SmrgHow it's done:
706f2543Smrg
706f2543SmrgPolling the current time of day from the OS is too expensive to
706f2543Smrgbe done at each request boundary, so instead an interval timer is
706f2543Smrgset allowing the server to track time changes by counting invocations
706f2543Smrgof the related signal handler.  Instead of using the wall time for
706f2543Smrgthis purpose, the process CPU time is used instead.  This serves
706f2543Smrgtwo purposes -- first, it allows the server to consume no CPU cycles
706f2543Smrgwhen idle, second it avoids conflicts with SIGALRM usage in other
706f2543Smrgparts of the server code.  It's not without problems though; other
706f2543SmrgCPU intensive processes on the same machine can reduce interactive
706f2543Smrgresponse time within the X server.  The dispatch loop can now
706f2543Smrgcalculate an approximate time value using the number of signals
706f2543Smrgreceived.  The granularity of the timer sets the scheduling jitter,
706f2543Smrgat 20ms it's only occasionally noticeable.
706f2543Smrg
706f2543SmrgThe changes to WaitForSomething and ReadRequestFromClient are
706f2543Smrgstraightforward, adjusting when select is called and avoiding
706f2543Smrgsetting isItTimeToYield too often.
706f2543Smrg
706f2543SmrgThe dispatch loop changes are more extensive, now instead of
706f2543Smrgexecuting requests from all available clients, a single client
706f2543Smrgis chosen after each call to WaitForSomething, requests are
706f2543Smrgexecuted for that client and WaitForSomething is called again.
706f2543Smrg
706f2543SmrgEach client is assigned a priority, the dispatch loop chooses the
706f2543Smrgclient with the highest priority to execute.  Priorities are
706f2543Smrgupdated in three ways:
706f2543Smrg
706f2543Smrg 1.	Clients which consume their entire slice are penalized
706f2543Smrg 	by having their priority reduced by one until they
706f2543Smrg	reach some minimum value.
706f2543Smrg
706f2543Smrg 2.	Clients which have executed no requests for some time
706f2543Smrg 	are praised by having their priority raised until they
706f2543Smrg	return to normal priority.
706f2543Smrg
706f2543Smrg 3.	Clients which receive user input are praised by having
706f2543Smrg 	their priority rased until they reach some maximal
706f2543Smrg	value, above normal priority.
706f2543Smrg
706f2543SmrgThe effect of these changes is to both improve interactive application
706f2543Smrgresponse and benchmark numbers at the same time.
706f2543Smrg
706f2543Smrg
706f2543Smrg
706f2543Smrg
706f2543Smrg
706f2543Smrg$XFree86: $