Research

Pioneering data-driven algorithms to make actionable actions in the real world.


Open Source / Software

Our group values practical and reproducible research. Below are open-source libraries, benchmarks, and tools built by the DaRL group and our collaborators, spanning reinforcement learning for transportation, large-model agents, and spatio-temporal data mining. For the full list of repositories, see our GitHub organization

CityFlow

Traffic Simulator ยท WWW’19

A multi-agent reinforcement-learning environment for large-scale city traffic scenarios, orders of magnitude faster than SUMO for training RL-based signal control.

SimulatorMARLTraffic

Honor of Kings Arena

MARL Benchmark ยท NeurIPS’22 D&B

A competitive multi-agent reinforcement-learning environment built on Tencent’s Honor of Kings MOBA game, designed to benchmark generalization across heroes, lineups, and opponents.

MARLGame AIBenchmark

Instructional Agents

GenAI for Education ยท EACL’26 Main

A multi-agent LLM system that reduces teaching-faculty workload by automating instructional design โ€” lecture plans, slides, quizzes, and course material generation.

LLM AgentsEducationEdTech

IntelliLight

Traffic Signal Control ยท KDD’18

A reinforcement-learning approach for intelligent traffic-light control โ€” one of the earliest deep-RL methods for adaptive signal control on real-world road networks.

RLTraffic Signal

RL_Signals

Community Resource

A curated hub of papers, datasets, simulators, and tutorials covering reinforcement learning for traffic signal control โ€” the go-to reading list for newcomers to the area.

RLTraffic SignalAwesome List

LibSignal

Traffic Signal Control ยท MLJ

A unified open library for traffic signal control with reproducible RL baselines across simulators (CityFlow, SUMO), standardized benchmarks, and cross-simulator evaluation.

RLBenchmarkTraffic Signal

PyDimension

Scientific ML ยท Nature Communications’22

Dimensionless learning โ€” data-driven discovery of dimensionless numbers and scaling laws from scarce experimental measurements, combining physics priors with sparse regression.

Scientific MLPhysicsDiscovery

PromptGAT

Sim-to-Real Transfer ยท AAAI’24

Prompt-to-Transfer: closing the sim-to-real gap for traffic signal control by conditioning a grounded action transformer on natural-language prompts.

Sim-to-RealPrompt LearningTraffic

CoMAL

Multi-Agent LLM ยท SDM’25

Collaborative Multi-Agent Large Language Models for mixed-autonomy traffic โ€” coordinating CAVs and human-driven vehicles via LLM-based negotiation and planning.

Multi-Agent LLMMixed-AutonomyTraffic

CityFlowER

Traffic Simulator

An efficient and realistic traffic simulator with embedded machine-learning vehicle-behavior models, bridging the gap between rule-based speed and data-driven realism.

SimulatorEmbedded MLTraffic

Open-TI

LLM Agent ยท IJMLC

Open Traffic Intelligence โ€” an augmented-language-model agent that turns natural-language instructions into traffic analysis, simulator control, and signal-policy actions end-to-end.

LLM AgentTrafficTool Use

Sponsors


Generative AI

Projects: OpenTI, ICLR'26a, EACL'26a (Instructional Agents), EACL'26b, EACL'26c ,ACL'25a, ACL'25b, KDD'25a, KDD'25b, IJCAI'25 (DeepShade), SDM'25a, SDM'25b

Generative AI expresses the possibility of human-like AI. We investigate its potential and pitfalls.

Sim-to-real Transfer

Papers: Survey, ICLR'26b, AAAI'26, RLC'25, ICCPS'25a, AAAI'24aAAAI'24b, ITSC'24 (SynTrac), CDC'23a,CASE'23

Training in simulation would fail to perform similarly in the real world. We investigate how to transfer from simulation to the real world.

Learning to Simulate

Papers: ECML-PKDD'24aPADS'23ERA'23KDD'22AAAI'21ICDE'21ECML-PKDD'20AAAI'20 Workshop

Realistic simulators are a step closer towards policymaking for the real world. We investigate how to build realistic simulators from real-world data.

Simulator/Environment Building/Datasets

Project websites: CityFlowER, Honor of Kings (็Ž‹่€…่ฃ่€€)LibSignalCityFlowEpidemicProduct Allocator

Simulators are the foundation of reinforcement learning. We built a bunch of simulators for various applications, including MOBA Games, transportation, epidemics, and product allocation.

Trustworthy Deep Learning

Papers: EACL'25b, EACL'25c, SIGKDD Explorations'25, COLM'25, KDD'25a, KDD'25c, ICML'25a, ICML'25b, AAAI'24aAAAI'24cICDM'23CDC'23aCIKM'23KDD'23IJCAI'23ERA'23AAAI'23IAAI'22IJCAI'21aIJCAI'21bUSENIX Security'21 (Adversarial Policies)NeurIPS'20 Workshop

The project investigates different aspects of trustworthy deep learning, including robust modeling for deep learning models with physics, reinforcement learning with offline data, and adversarial policy training.

Deep Reinforcement Learning

Papers: Survey (Arxiv)Survey(KDD Explorations)AAAI'24aCDC'23aCDC'23bCASE'23IJCAI'23AAAI'23AAAI'20KDD'19CIKM'19aCIKM'19bKDD'18

The project systematically investigates "smart" traffic light control systems using deep reinforcement learning and evaluate its effectiveness on both synthetic and real-world traffic data.

Spatio-temporal Data Mining

Papers: ICCPS'25b, ECML-PKDD'24b, ICDM'23ERA'23a, ERA'23bAAAI'21NeurIPS'20 WorkshopAAAI'19TKDD'19WWW'19PAKDD'18CIKM'16

This project investigated the spatial-temporal prediction problems with applications in smart cities.