Human Behavior

Nov 23, 2025 · 10 min read
Why should we study human behavior?

Picture this. It’s an average day. You wake up, brush your teeth, make coffee, and get ready for work. If you have a kid, you get them ready to go to school, and you might walk them to the bus stop down the road. You drive to work and start your computer. For lunch, you heat up your food in the microwave (or you go out to get food elsewhere). Perhaps you take a walk. On your way home, you stop by the grocery store, or maybe the gym. Once you get home, you make dinner, run a load of laundry, and relax to your favorite tv show before showering and getting ready for bed.

Humans behavior is at the heart of any city. At nearly every step throughtout the day, we use energy, whether it’s lights, driving, computers, food preparation (or storage!), or hot water for a shower. But even during a lunch-time walk, most people will bring a phone or smart watch.

Now, imagine trying to scale this up to try and understand, model, or predict the population behavior for an entire city, state, or country!

Illustration of the different stops one might take during a typical day, using icons and arrows.

Building Occupancy…

As you might imagine from this example, building occupancy significantly impacts energy use and how it varies throughout the day. It affects the timing for demand impacts, and is a significant source of uncertainty in building energy modeling. Traditionally, building energy modeling uses static occupancy schedules. Below, I will share two different approaches which I have used to generate customized, more accurate schedules.

…from Simulated Traffic

My first work on building occupancy was part of the Urban Exascale Computing Project which explored the interactions between buildings, transportation, humans, and climate.

For this project, we used simulated traffic in the greater Chicago area (34sqkm). We used population models and survey data as inputs to the traffic model to represent population distributions, and human behavior and decision-making (e.g. typical departure times for commute). We used the traffic model outputs to determine arrival/departures near buildings, and produce building occupancy schedules.

Large and complex workflow graphic that is spread over three lines of blocks. It shows the processing of 5 unique measured data sources, survey data and outputs from 2 different models, as well as derived products.
Data fusion workflow split into three different sections. 1. Fusing weather data with traffic speeds to analyze weather impact on driving behavior. 2. Producing traffic simulation inputs from a population model and commuter statistics, and fusing the resulting simulation data to road geometries. 3. Fusing of building data, traffic simulation outputs, and travel survey data to produce building occupancy schedules which are used by a building energy model.

My Contributions
  • Solicited traffic data from Illinois Department of Transportation (IDOT) for traffic models.
  • Implemented coupling between population and transportation mode choice models and traffic simulations (TRANSIMS/SUMO) to generate realistic transit scenarios for an area of 34sqm in the Chicago metropolitan area.
  • Designed and set up realistic, data-driven traffic scenarios for TRANSIMS.
  • Set up parallel TRANSIMS simulation models which ran on Titan supercomputer.
  • Implemented coupling between traffic simulation and building energy models (EnergyPlus) through occupancy.
  • Optimized the computational complexity of generating occupancy data through the use of quadtrees, a recursive partitioning method.

Map of the Chicago Loop area with small yellow dots representing agents, teal buildings, and a visual representation of the quadtree subdivision.
Hierarchical subdivision (quadtree) of the Chicago Loop reduces computational complexity.

Selected Publications

…from Traffic Sensor Measurements

This work began with the installation of large numbers of traffic sensors in Chattanooga, Tennessee, which I was familiar with due to my traffic-related work. Due to the high coverage in the downtown area, I wanted to explore whether it would be possible to get close estimates for building occupancy.

Since the sensors provide data in real-time, it would be possible to determine real-time occupancy. This could be used as an input for real-time building management systems.

My Contributions
  • Processed and managed traffic counts from over 30 IoT traffic monitoring devices.
  • Mapped the device topology to geographic locations and produced a Voronoi tesselation of the region.
  • Determine which amount of floor area of each building falls in the different Voronoi cells.
  • Determined arrivals and departures of vehicles between each pair of neighboring intersections.
  • Used the topological mapping to assign the percentage of arriving and departing traffic to 600 buildings, based on proximity and building size.

Animation showing a map of buildings, which is divided into Voronoi cells. As the view zooms into a smaller area, incoming traffic is assigned to the surrounding buildings.
Animation illustrating the different steps of the workflow.

If you’re interested in learning about my work on scaling up building energy simulation to national scale, check out my work on [automated building energy modeling]../buildings).

Media Coverage

…from Cellphone GPS Data

This work was part of a big project that studied the human dimensions of energy systems. Some of this work centered around modeling people’s willingness to participate in demand response programs from utilities. My project thrust team’s focus was to understand the impact that humans have on the energy system (e.g. increased energy use during heat waves and winter storms), and the impact of extreme weather and power outages on humans.

A visual overview of the analysis process. The top left shows renderings of temperatures in Texas before (above freezing) and during Uri (extremely cold). The right shows a time series of outage patterns in which arrows point to early outages due to frozen wind turbine blades, and the beginning of natural gas equipment freezing which led to widespread outages. At the bottom, there are two maps. One shows buildings colored by type, and one shows changes in visit patterns (green and purple). On the bottom right, there’s a bar chart which shows a few positive and a lot of negative bars, representing overall visits by building type.
In February 2021, Winter Storm Uri caused temperatures to rapidly drop up to 50℉/25℃ below typical Texas winter temperatures (see comparison on the top left), and due to the isolated nature of the Texas powergrid, there was little room for compensation for the additional load. The top right shows a heatmap comparison of power outages over time (x-axis) for different Texas counties (y-axis). The red line indicates the threshold for the 10% most affected counties (in the tool itself, hovering reveals more information about the counties and the extent of the outages).

One major part of this work was to study human travel patterns. As most trips are taken for the purpose of visiting a specific destination, we used Point of Interest (POI) visit data which was derived from cellphone GPS data. This data tells us how which places someone visited, and how long. In the example of a typical day, the “trip chain” would be home (Census block group) ➡️ office ➡️ walk (if near POI) ➡️ office ➡️ grocery store ➡️ home.

As the overall objective of the project was to study the relationship to energy, we chose Winter Storm Uri as our case study. This winter storm caused widespread power outages in Texas. This caused behavior changes as people were stuggling to keep their homes warm. While overall travel dropped, gas stations were among the places which saw increased visits as many households resorted to using generators. Another increase was seen in the visits to community centers and large event spaces which served as cold shelters.

A visual overview of the analysis process. The top left shows renderings of temperatures in Texas before (above freezing) and during Uri (extremely cold). The right shows a time series of outage patterns in which arrows point to early outages due to frozen wind turbine blades, and the beginning of natural gas equipment freezing which led to widespread outages. At the bottom, there are two maps. One shows buildings colored by type, and one shows changes in visit patterns (green and purple). On the bottom right, there’s a bar chart which shows a few positive and a lot of negative bars, representing overall visits by building type.
The tool provides navigation elements for users to select two timeframes they want to compare. In this case, we chose the 3 days with most intense outages, and an equivalent 3-day window two weeks prior, before the winter storm hit. The bottom shows buildings colored by POI type (for buildings with multiple POI, we chose the type with the highest importance – shown in the legend on the left). The map in the middle shows increases (green) and decreases (purple) in visits during the storm, compared with pre-storm conditions. The changes in visits/occupancy by POI subtype (colored by POI type) are shown on the bottom right. Large Event Spaces (which served as cold shelters) saw an increase in occupancy that’s just a little over the decrease in occupancy of residential homes, and the visits to correctional facilities dropped dramatically.

My Contributions
  • Co-led a team of 5 researchers for 1.5 years of the project.
  • Developed a HPC-based pipeline to process POI data from GPS traces.
  • Determined detailed building occupancy based on POI visits.
  • Processed, cleaned, de-duplicated, and enriched over 500GB of GPS data using parallelization (map reduce) on Kestrel supercomputer.
  • Developed a hybrid method to combine deterministic and ChatGPT-based assignment of POI types to 130k POIs based on their business names.
  • Conducted a case study analyzing changes in mobility and occupancy patterns during the 2021 Winter Storm Uri power outages in Texas.
  • Mentored a postdoctoral researcher and guided development of interactive visualization tools for the relation between power outages and human travel behavior.

A workflow showing multiple steps to get from points of interest geometries and visit data to augmented visits. The steps include a two-way mapping between POIs and buildings.
The data workflow to produce the augmented building visit data which was used in this work.

Settlement and Building Detection in Satellite Imagery

A grayscale satellite image of an urban area with building footprints highlighted in purple.
Results from the building detection workflow showing individual buildings which the model detected.

Humans generally live in communities, from small settlements to major metropolitan areas. For Western countries, we generally have a good understanding thanks to Census data. But in many countries, there is no comprehensive Census, others are developing so fast that Census data from a few years ago is already very out of date.

If a non-profit wants to deliver vaccines to everyone in an entire country, it’s important to know how many vaccines (and volunteers) they should send, and where exactly to send them.

A selfie of 7 people sitting around a table with their laptops, smiling at the camera.
Most of our project team during a GPU Hackathon.

For this project, the team developed a workflow to run building detection and settlement detection for satellite imagery at unprecedented scale, using supercomputer Titan, which ranked 4th-7th in the TOP500 at the time.

My Contributions
  • I managed a project team of 9 researchers to scale up settlement and building detection Convolutional Neural Network (CNN) workflows to full country scale.
  • Developed a data workflow from local clusters and DGX boxes to deployment of deep learners on Titan.
  • Build a portable large-scale image workflow for machine learning which is can be adapted to new problems and machines.
  • Developed and optimized GPU-based image tiling technique to integrate in deep learning workflow (pycuda).

Workflow graphic with two rows.
Overview of the entire settlement and building detection workflow. The top row shows the human structure mapping workflow. It outlines the processing steps taken as data flows through different systems. The bottom row shows the model execution workflow with traditional CPU tiling and comparison to GPU tiling. Colors indicate whether operations are done on the CPU, GPU, or both.

Other Successes
  • Detected all buildings in Yemen in under 2 hours.
  • Detected all settlements in Zambia in 3 hours 45 minutes.
  • Detected all swimming pools in Texas in 20 minutes.

That last one was mostly for fun, but was relevant for an assessment of swimming pool health based on the amount of algae detected (most satellites include wavelengths outside the visible light spectrum, like infrared, which can be used to study biomass, vegetation, and more).