Awarded

PS21196 - Bayesian Synthetic Population Algorithm Development, for National Buildings Model

Descriptions

***** THIS IS AN AWARD NOTICE, NOT A CALL FOR COMPETITION ***** This procurement is being concluded following a mini competition under the RM6018 - Crown Commercial Services Research Marketplace DPS This Invitation to Tender aims to procure, on behalf of the BEIS Secretary of State, an implemented methodology for producing synthetic population sample data from multiple overlapping data sources, relating to non-domestic buildings' energy use in the UK. The BEIS National Buildings Model (NBM) makes use of disclosive property survey data to represent the diverse building population of the UK. While data of this type is richly detailed and necessary for building physics simulation, the sensitivity and relatively small sample sizes present a dual challenge. Data protection compliance requires that the "stock" datasets derived from surveys are not published, preventing external replication of BEIS analysis even once the NBM itself is published. Simultaneously, BEIS wishes to reconcile the weighted survey data with other trusted information that has been collected on the same population. These alternative data sources are diverse, from national aggregate statistics to meter-point data collected for most individual UK properties. We propose that a synthetic dataset can resolve both issues. Synthetic data generators are algorithms for condensing the important properties of a dataset into a set of cross-correlations (a modelled distribution of traits). From this, a new "sample" can be drawn which preserves the key relationships we wish to infer from the original data, while scrambling everything else. Applied to a single dataset, this can ensure that private information is not disclosed, while maintaining the format of a detailed survey. The synthetic data concept can be extended, producing a single generating algorithm from multiple otherwise incompatible datasets. The resulting "samples" would be a population of imaginary building records which are nonetheless collectively consistent with everything we (think we) know about the true population. This project will procure expert assistance in the creation of this generating algorithm. The scope will be limited to non-domestic buildings energy use, but the approach taken is expected to be eventually extended to cover domestic buildings (which have their own unique data inputs) and potentially other domains as well. Therefore, flexibility and modularisation are important factors in the implementation. The model will be developed and implemented in an appropriate programming language (Python 3 is preferred for compatibility with the NBM, but tenderers may make a case for alternatives, such as R, if they think it necessary). Development will be version controlled using Git. The contractor will therefore need expertise in both software development and statistical inference/machine learning. Bayesian procedures have featured heavily in the exploratory work conducted so far (see below).

Timeline

Published Date :

6th Jan 2022 3 years ago

Deadline :

3rd Nov 2021 3 years ago

Tender Awarded :

1 Supplier

Awarded date :

7th Dec 2021

Contract Start :

29th Nov 2021

Contract End :

31st Mar 2022

Tender Regions

UK

Let’s Get you Started ✍

Get to see all tender details more briefly

Already have an account ?

Workflows

Status :

Awarded

Assign to :

Tender Progress :

0%

Details

Notice Type :

Open opportunity

Tender Identifier :

IT-378-246-T: 2024 - 001

TenderBase ID :

310724019

Low Value :

£100K

High Value :

£1000K

Region :

North Region

Attachments :

Buyer Information

Address :

Liverpool Merseyside , Merseyside , L13 0BQ

Website :

N/A

Procurement Contact

Name :

Tina Smith

Designation :

Chief Executive Officer

Phone :

0151 252 3243

Email :

tina.smith@shared-ed.ac.uk

Possible Competitors

1 Possible Competitors