Nbsp; How the Compiler Transforms an Async Function into a State Machine
When working with async functions, you will be more productive with them if you have an under- standing and appreciation for the code transform that the compiler is doing for you. And, I think the easiest and best way for you to learn that is by going through an example. So, let’s start off by defin- ing some simple type definitions and some simple methods.
internal sealed class Type1 { } internal sealed class Type2 { }
private static async Task<Type1> Method1Async() {
/* Does some async thing that returns a Type1 object */
}
private static async Task<Type2> Method2Async() {
/* Does some async thing that returns a Type2 object */
}
4 Instead of blocking a thread by having it wait on a thread synchronization construct, you could await the task re- turned from calling SemaphoreSlim’s WaitAsync method or my own OneManyLock’s AcquireAsync method. I discuss both of these in Chapter 30, “Hybrid Thread Synchronization Constructs.”
Now, let me show you an async function that consumes these simple types and methods.
Type1 result1 = await Method1Async(); for (Int32 x = 0; x < 3; x++) {
Type2 result2 = await Method2Async();
}
}
catch (Exception) { Console.WriteLine("Catch");
}
finally {
Console.WriteLine("Finally");
}
return "Done";
}
Although MyMethodAsync seems rather contrived, it demonstrates some key things. First, it is an async function itself that returns a Task<String> but the code’s body ultimately returns a String. Second, it calls other functions that execute operations asynchronously, one stand-alone and the other from within a for loop. Finally, it also contains exception handling code. When compiling My MethodAsync, the compiler transforms the code in this method to a state machine structure that is capable of being suspended and resumed.
I took the preceding code, compiled it, and then reverse engineered the IL code back into C# source code. I then simplified the code and added a lot of comments to it so you can understand what the compiler is doing to make async functions work. The following is the essence of the code created by the compiler’s transformation. I show the transformed MyMethodAsync method as well as the state machine structure it now depends on.
// AsyncStateMachine attribute indicates an async method (good for tools using reflection);
// the type indicates which structure implements the state machine [DebuggerStepThrough, AsyncStateMachine(typeof(StateMachine))] private static Task<String> MyMethodAsync(Int32 argument) {
// Create state machine instance & initialize it StateMachine stateMachine = new StateMachine() {
// Create builder returning Task<String> from this stub method
// State machine accesses builder to set Task completion/exception m_builder = AsyncTaskMethodBuilder<String>.Create(),
m_state = 1, // Initialize state machine location m_argument = argument // Copy arguments to state machine fields
};
// Start executing the state machine stateMachine.m_builder.Start(ref stateMachine);
return stateMachine.m_builder.Task; // Return state machine's Task
}
// This is the state machine structure [CompilerGenerated, StructLayout(LayoutKind.Auto)] private struct StateMachine : IAsyncStateMachine {
// Fields for state machine's builder (Task) & its location public AsyncTaskMethodBuilder<String> m_builder;
public Int32 m_state;
// Argument and local variables are fields now: public Int32 m_argument, m_local, m_x;
public Type1 m_resultType1; public Type2 m_resultType2;
// There is 1 field per awaiter type.
// Only 1 of these fields is important at any time. That field refers
// to the most recently executed await that is completing asynchronously: private TaskAwaiter<Type1> m_awaiterType1;
private TaskAwaiter<Type2> m_awaiterType2;
// This is the state machine method itself void IAsyncStateMachine.MoveNext() {
String result = null; // Task's result value
// Compilerinserted try block ensures the state machine’s task completes try {
Boolean executeFinally = true; // Assume we're logically leaving the 'try' block if (m_state == 1) { // If 1st time in state machine method,
m_local = m_argument; // execute start of original method
}
// Try block that we had in our original code try {
// After the first await, we capture the result & start the 'for' loop m_resultType1 = awaiterType1.GetResult(); // Get awaiter's result
ForLoopPrologue:
m_x = 0; // 'for' loop initialization goto ForLoopBody; // Skip to 'for' loop body
ForLoopEpilog:
m_resultType2 = awaiterType2.GetResult();
m_x++; // Increment x after each loop iteration
// Fall into the 'for' loop’s body
ForLoopBody:
if (m_x < 3) { // 'for' loop test
// Call Method2Async and get its awaiter awaiterType2 = Method2Async().GetAwaiter(); if (!awaiterType2.IsCompleted) {
m_state = 1; // 'Method2Async' is completing asynchronously m_awaiterType2 = awaiterType2; // Save the awaiter for when we come back
// Tell awaiter to call MoveNext when operation completes m_builder.AwaitUnsafeOnCompleted(ref awaiterType2, ref this);
executeFinally = false; // We're not logically leaving the 'try' block return; // Thread returns to caller
}
// 'Method2Async' completed synchronously
goto ForLoopEpilog; // Completed synchronously, loop around
}
}
catch (Exception) { Console.WriteLine("Catch");
}
finally {
// Whenever a thread physically leaves a 'try', the 'finally' executes
// We only want to execute this code when the thread logically leaves the 'try' if (executeFinally) {
Console.WriteLine("Finally");
}
}
result = "Done"; // What we ultimately want to return from the async function
}
catch (Exception exception) {
// Unhandled exception: complete state machine's Task with exception m_builder.SetException(exception);
return;
}
// No exception: complete state machine's Task with result m_builder.SetResult(result);
}
}
If you spend the time to walk through the preceding code and read all the comments, I think you’ll be able to fully digest what the compiler does for you. However, there is a piece of glue that attaches the object being awaited to the state machine and I think it would be helpful if I explained how this piece of glue worked. Whenever you use the await operator in your code, the compiler takes the specified operand and attempts to call a GetAwaiter method on it. This method can be either an instance method or an extension method. The object returned from calling the GetAwaiter method is referred to as an awaiter. An awaiter is the glue I was referring to.
After the state machine obtains an awaiter, it queries its IsCompleted property. If the operation completed synchronously, true is returned and, as an optimization, the state machine simply contin- ues executing. At this point, it calls the awaiter’s GetResult method, which either throws an excep- tion if the operation failed or returns the result if the operation was successful. The state machine continues running from here to process the result.
If the operation completes asynchronously, IsCompleted returns false. In this case, the state ma- chine calls the awaiter’s OnCompleted method passing it a delegate to the state machine’s MoveNext method. And now, the state machine allows its thread to return back to where it came from so that it can execute other code. In the future, the awaiter, which wraps the underlying Task, knows when it completes and invokes the delegate causing MoveNext to execute. The fields within the state machine are used to figure out how to get to the right point in the code, giving the illusion that the method
is continuing from where it left off. At this point, the code calls the awaiter’s GetResult method and execution continues running from here to process the result.
That is how async functions work and the whole purpose is to simplify the coding effort normally involved when writing non-blocking code.