Research

Pioneering data-driven algorithms to make actionable actions in the real world.

Open Source / Software

Our group values practical and reproducible research. Below are open-source libraries, benchmarks, and tools built by the DaRL group and our collaborators, spanning reinforcement learning for transportation, large-model agents, and spatio-temporal data mining. For the full list of repositories, see our GitHub organization

CityFlow

Traffic Simulator · WWW’19

A multi-agent reinforcement-learning environment for large-scale city traffic scenarios, orders of magnitude faster than SUMO for training RL-based signal control.

💻 Repo 📘 Docs 📄 Paper

SimulatorMARLTraffic

Honor of Kings Arena

MARL Benchmark · NeurIPS’22 D&B

A competitive multi-agent reinforcement-learning environment built on Tencent’s Honor of Kings MOBA game, designed to benchmark generalization across heroes, lineups, and opponents.

💻 Repo 🌐 Site 📄 Paper

MARLGame AIBenchmark

Instructional Agents

GenAI for Education · EACL’26 Main

A multi-agent LLM system that reduces teaching-faculty workload by automating instructional design — lecture plans, slides, quizzes, and course material generation.

💻 Repo

LLM AgentsEducationEdTech

IntelliLight

Traffic Signal Control · KDD’18

A reinforcement-learning approach for intelligent traffic-light control — one of the earliest deep-RL methods for adaptive signal control on real-world road networks.

💻 Repo 📄 Paper

RLTraffic Signal

RL_Signals

Community Resource

A curated hub of papers, datasets, simulators, and tutorials covering reinforcement learning for traffic signal control — the go-to reading list for newcomers to the area.

💻 Repo 🌐 Website

RLTraffic SignalAwesome List

LibSignal

Traffic Signal Control · MLJ

A unified open library for traffic signal control with reproducible RL baselines across simulators (CityFlow, SUMO), standardized benchmarks, and cross-simulator evaluation.

💻 Repo 📘 Docs 📄 Paper

RLBenchmarkTraffic Signal

PyDimension

Scientific ML · Nature Communications’22

Dimensionless learning — data-driven discovery of dimensionless numbers and scaling laws from scarce experimental measurements, combining physics priors with sparse regression.

💻 Repo 📘 Book 📄 Paper

Scientific MLPhysicsDiscovery

PromptGAT

Sim-to-Real Transfer · AAAI’24

Prompt-to-Transfer: closing the sim-to-real gap for traffic signal control by conditioning a grounded action transformer on natural-language prompts.

💻 Repo 📄 Paper

Sim-to-RealPrompt LearningTraffic

CoMAL

Multi-Agent LLM · SDM’25

Collaborative Multi-Agent Large Language Models for mixed-autonomy traffic — coordinating CAVs and human-driven vehicles via LLM-based negotiation and planning.

💻 Repo 🌐 Demo

Multi-Agent LLMMixed-AutonomyTraffic

CityFlowER

Traffic Simulator

An efficient and realistic traffic simulator with embedded machine-learning vehicle-behavior models, bridging the gap between rule-based speed and data-driven realism.

💻 Repo 📄 Paper

SimulatorEmbedded MLTraffic

Open-TI

LLM Agent · IJMLC

Open Traffic Intelligence — an augmented-language-model agent that turns natural-language instructions into traffic analysis, simulator control, and signal-policy actions end-to-end.

💻 Repo 📄 Paper

LLM AgentTrafficTool Use

Generative AI

Projects: OpenTI, ICLR'26a, EACL'26a (Instructional Agents), EACL'26b, EACL'26c ,ACL'25a, ACL'25b, KDD'25a, KDD'25b, IJCAI'25 (DeepShade), SDM'25a, SDM'25b

Generative AI expresses the possibility of human-like AI. We investigate its potential and pitfalls.

Sim-to-real Transfer

Papers: Survey, ICLR'26b, AAAI'26, RLC'25, ICCPS'25a, AAAI'24a, AAAI'24b, ITSC'24 (SynTrac), CDC'23a,CASE'23

Training in simulation would fail to perform similarly in the real world. We investigate how to transfer from simulation to the real world.

Learning to Simulate

Papers: ECML-PKDD'24a, PADS'23, ERA'23, KDD'22, AAAI'21, ICDE'21, ECML-PKDD'20, AAAI'20 Workshop

Realistic simulators are a step closer towards policymaking for the real world. We investigate how to build realistic simulators from real-world data.

Simulator/Environment Building/Datasets

Project websites: CityFlowER, Honor of Kings (王者荣耀), LibSignal, CityFlow, Epidemic, Product Allocator

Simulators are the foundation of reinforcement learning. We built a bunch of simulators for various applications, including MOBA Games, transportation, epidemics, and product allocation.

Trustworthy Deep Learning

Papers: EACL'25b, EACL'25c, SIGKDD Explorations'25, COLM'25, KDD'25a, KDD'25c, ICML'25a, ICML'25b, AAAI'24a, AAAI'24c, ICDM'23, CDC'23a, CIKM'23, KDD'23, IJCAI'23, ERA'23, AAAI'23, IAAI'22, IJCAI'21a, IJCAI'21b, USENIX Security'21 (Adversarial Policies), NeurIPS'20 Workshop

The project investigates different aspects of trustworthy deep learning, including robust modeling for deep learning models with physics, reinforcement learning with offline data, and adversarial policy training.

Deep Reinforcement Learning

Papers: Survey (Arxiv), Survey(KDD Explorations), AAAI'24a, CDC'23a, CDC'23b, CASE'23, IJCAI'23, AAAI'23, AAAI'20, KDD'19, CIKM'19a, CIKM'19b, KDD'18

The project systematically investigates "smart" traffic light control systems using deep reinforcement learning and evaluate its effectiveness on both synthetic and real-world traffic data.

Spatio-temporal Data Mining

Papers: ICCPS'25b, ECML-PKDD'24b, ICDM'23, ERA'23a, ERA'23b, AAAI'21, NeurIPS'20 Workshop, AAAI'19, TKDD'19, WWW'19, PAKDD'18, CIKM'16

This project investigated the spatial-temporal prediction problems with applications in smart cities.

Research

Open Source / Software

CityFlow

Honor of Kings Arena

Instructional Agents

IntelliLight

RL_Signals

LibSignal

PyDimension

PromptGAT

CoMAL

CityFlowER

Open-TI

Sponsors

Generative AI

Sim-to-real Transfer

Learning to Simulate

Simulator/Environment Building/Datasets

Trustworthy Deep Learning

Deep Reinforcement Learning

Spatio-temporal Data Mining