Abstract
Ramp driving poses a big challenge to autonomous vehicles, on which there are potential traffic conflicts between vehicles. Therefore, it is necessary to study ramp scenarios for development and testing. In this paper, typical ramp scenarios are studied based on naturalistic driving data (NDD). First, three major elements are defined to describe the interaction between vehicles on the ramp, including the initial state (S), the driving action (A) and the interaction performance (P). Next, variables to characterize the A and the P are selected to be clustering features, and then 8 kinds of categories are obtained by the K⁃means clustering method based on the Calinski-Harabasz (CH) index. Then, according to the clustering results, 4 kinds of typical interaction modes are obtained by analyzing the variables above. Afterwards, by analyzing the variables that characterize the S, typical logical scenarios are extracted by the confidence ellipse. Finally, based on the logical scenarios, two concrete scenarios are selected to test and evaluate the autonomous driving system (ADS). The results show that testing with typical ramp scenarios can reveal the social cooperation capabilities of autonomous vehicles. Therefore, it is effective to generate typical ramp scenarios by clustering analysis based on NDD.
With the development of technology, intelligent vehicles have gradually had the high-level autonomous driving function. As it has the capability to predict interactive trends and implement interactive behavior
The knowledge-based and the data-based methods for generating test scenarios are the two mainstreams. The research team of Technische Universität Braunschweig used the ontology method to generate test scenarios according to the knowledge of experts and traffic law
Relevant researches on interaction modes of ramp have been conducted recently. LIU proposes a lane-change prediction model of ramp based on the BP neural networ
In this paper, three major elements are defined to describe the interaction between vehicles on the ramp, including the initial state (S), the driving action (A), and the interaction performance (P). In addition, typical interaction modes are analyzed by using the clustering method based on the variables that characterize the driving action and the interaction performance, and typical ramp scenarios are extracted based on the distribution of variables that characterize the initial state. The results show that testing with typical ramp scenarios can reveal the social cooperation capabilities of autonomous vehicles. Therefore, it is effective to generate typical ramp scenarios by clustering analysis based on NDD.
In order to study the typical interaction mode on the ramp, firstly, interaction samples are selected based on naturalistic driving data (NDD). Secondly interaction types of vehicles on the ramp are defined and analyzed statistically based on NDD. Finally, typical interaction modes are studied by clustering analysis based on the samples of typical interaction types.
The NDD is collected on a ramp of expressway G50 in Shanghai by drone. The original data is 4 hours of video captured by aerial photography. The view of NDD is shown in

Fig.1 View of NDD

Fig.2 Definition of roads and vehicles in the ramp scenario
A total of 1252 interaction samples are selected to research the interaction modes of vehicles on the ramp.
The interaction type refers to the interaction behavior between the merging vehicle and the main-road vehicle in the confluence area. According to differences in the congestion of the ramp and the main-road, driving path of the merging vehicle, driving behavior of the merging vehicle and traffic condition on the main-road, 4 kinds of interaction types are defined. Simultaneously, the 1 252 samples of different interaction types are statistically analyzed. The distribution of interaction types is shown in

Fig.3 Distribution of interaction types (N=1 252)
The interaction of the lagging vehicle on the main-road means there is only one lagging vehicle on the main-road when merging. The interaction of the leading vehicle on the main-road means there is only one leading vehicle on the main-road when merging. The other types include no interaction, illegal driving, and so on.
According to the statistical results in
Clustering variables are important for clustering analysis. Redundant or too few clustering variables will affect the clustering effec
In this paper, three major elements abbreviated as SAP are defined to describe the interaction between vehicles on the ramp, including the initial state (S), the driving action (A), and the interaction performance (P). The diagram of SAP is manifested in

Fig.4 SAP of interaction between vehicles
Studies on the prediction model of the merging behavior show that interaction modes are mainly determined by the driving behavior and the merging performance when mergin
Element | Variables | Symbol | Meaning |
---|---|---|---|
S | Merging gap | D | distance between the lagging vehicle and the leading vehicle along the direction of the main-road |
Relative speed | ΔV | speed difference between the merging vehicle and the vehicle on the main-road | |
A | Speed difference of the merging vehicle | ΔVmv | difference between the speed when merging and the speed at the S |
Speed difference of the vehicle on the main-road | ΔVmrv | difference between the speed when merging and the speed at the S | |
P | Merging Position | P | distance from the ramp to the position when merging |
THW of the vehicle on the main-road when merging | TTHW | ratio of the two-vehicle distance to the speed of the vehicle on the main-road |
Because of the large sample size of the typical interaction type, the K-means clustering method based on the CH (Calinski-Harabasz) index is used to analyze the typical interaction modes, which avoids the problem of the selection of categories.
(1) K-means clustering method. For the sample data set X={x1,x2,…,xi,…,xn} and clustering numbers K, firstly, the clustering algorithm could select K samples randomly, and these samples are regarded as the clustering centers. Then, the Euclidean distances between the remaining samples and the clustering centers are calculated, and the remaining samples are assigned to the nearest category. Finally, the clustering centers of each category are recalculated. The above steps will continue until the optimal convergence condition is reached. The convergence condition is expressed as follows:
(1) |
where: pk is the new clustering center of the k-th category; mk is the previous clustering center of the k-th category. σ is the allowable error value of the clustering center.
(2) CH index refers to the ratio of inter-class dispersion and intra-class compactness, and the function is expressed as follows:
(2) |
where: Btrace is the trace of the covariance matrix of the inter-class data; Wtrace is the trace of the covariance matrix of the intra-class data; N is the total sample number of the sample data set; k is the number of clustering categories. Larger the value of CH index, the better the clustering effect.
Rezae
The clustering results from 348 samples are shown in

Fig.5 Value of CH index
Category | Number | Proportion/% |
---|---|---|
1 | 35 | 10.06 |
2 | 13 | 3.74 |
3 | 58 | 16.67 |
4 | 46 | 13.22 |
5 | 19 | 5.46 |
6 | 44 | 12.64 |
7 | 76 | 21.84 |
8 | 57 | 16.38 |
According to the statistical results in
Index | Value | 1 | 3 | 4 | 6 | 7 | 8 |
---|---|---|---|---|---|---|---|
P | Head (0~75 m) | 1 | 56 | 0 | 34 | 39 | 49 |
Middle (75~150 m) | 27 | 2 | 34 | 10 | 37 | 8 | |
End (>150 m) | 7 | 0 | 12 | 0 | 0 | 0 | |
ΔVmrv | Acceleration (>3 m/s) | 2 | 1 | 0 | 0 | 4 | 5 |
Stability (-3~3 m/s) | 28 | 15 | 26 | 1 | 63 | 41 | |
Deceleration (<-3 m/s) | 5 | 42 | 20 | 43 | 9 | 11 | |
ΔVmv | Acceleration (>3 m/s) | 27 | 1 | 2 | 7 | 0 | 41 |
Stability (-3~3 m/s) | 8 | 55 | 44 | 37 | 76 | 16 | |
Deceleration (<-3 m/s) | 0 | 2 | 0 | 0 | 0 | 0 | |
TTHW | With risk (<1 s) | 23 | 10 | 31 | 30 | 26 | 16 |
Without risk (>1 s) | 12 | 48 | 15 | 14 | 50 | 41 |
In order to determine the interaction mode of the typical category, 4 kinds of the interaction modes are defined based on whether there is a cooperative behavior and the potential collision risk when merging. The cooperative behavior refers to the acceleration and deceleration during interaction in confluence area. The interaction modes are defined in
Merging performance | Cooperative behavior | No cooperative behavior |
---|---|---|
With potential risk | Successful cooperative lane change (SCLC) | Weak-interaction lane change (WLC) |
Without potential risk | Unsuccessful cooperative lane change (USCLC) | Forced lane change (FLC) |
In Categories No. 1 and No. 6, because TTHW is less than 1, the potential risk between the merging vehicle and the vehicle on the main road is high. In terms of the driving action, there are cooperative behaviors that increase the merging gap in both categories. Therefore, Categories No. 1 and No. 6 belong to USCLC. The difference between the two categories is that in Category No. 1, the merging vehicle accelerates to try to increase the merging gap, and in Category No. 6, the vehicle on the main road decelerates to try to increase the merging gap.
In Categories No. 3 and No. 8, because TTHW is greater than 1, the potential risk between the merging vehicle and the vehicle on the main road is low. In terms of driving action, there are cooperative behaviors that increase the merging gap in both categories. Therefore, Categories No. 3 and No. 8 belong to SCLC. The difference between the two categories is that in Category No. 3, the vehicle on the main road decelerates to increase the merging gap, and in Category No. 8, the merging vehicle accelerates to increase the merging gap.
In Category No. 4, the potential risk between the merging vehicle and the vehicle on the main road is high. Moreover, there are no cooperative behaviors that increase the merging gap. Thus, Category No. 4 belongs to FLC.
In Category No. 7, the potential risk between the merging vehicle and the vehicle on the main road is low. Moreover, there are no cooperative behaviors that increase the merging gap. Therefore, Category No. 7 belongs to WLC.
The density of scatter plot could reflect the distribution of variables in a continuous area. Therefore, the characteristics of the variables can be analyzed. In order to understand the difference between the interaction mode and the interaction mechanism, the distribution of variables that characterize the S for different interaction modes are analyzed by the density of scatter plot, and the result is shown in

Fig.6 Density of scatter plot
Compared with other interaction modes, initial relative speed is a lower(median is -2 m/s) between the merging vehicle and the vehicle on the main road in WLC, and the merging gap is also large (median is larger than 50 m). Therefore, the merging vehicle and the vehicle on the main road could drive normally without any behavior that could increase the merging gap.
Compared with the FLC and USCLC, initial relative speed is a higher (median is -5 m/s) in SCLC, but the merging gap is large (median is larger 50 m). Therefore, there is enough space for the merging vehicle and the vehicle on the main road to carry out the cooperative behavior, thereby the merging vehicle could drive without collision risk.
In USCLC and FLC, the initial relative speed is high(median is -8 m/s), and the merging gap is small (median is less than 30 m). Therefore, there is not enough space for the merging vehicle and vehicle on the main road to increase the merging gap and reduce the relative speed, thereby there is a large potential collision risk between the merging vehicle and the vehicle on the main road.
Typical ramp scenarios could be understood as the scenarios that appear frequently in the real world and could include different interaction modes. In order to extract the typical ramp scenarios, the distribution of scenario parameters for different interaction modes is analyzed by the method of confidence ellipse based on the clustering results. The confidence ellipses of different interaction modes at a confidence of 75% are shown in

Fig.7 Confidence ellipses of interactive modes at a confidence of 75%
According to
In addition to the merging gap and the relative speed, scenario parameters also include the initial speed of the merging vehicle, the initial speed of the leading vehicle on the main road, and the distance between the merging vehicle and the lagging vehicle on main road. Based on the samples distributed in the gray area in
Parameter | Symbol | Unit | Range |
---|---|---|---|
Initial speed of vehicle A | VA0 | m/s | [11.1, 20.6] |
Relative speed between vehicle A and vehicle C | ΔV | m/s | [-6.7, -1.8] |
Initial speed of vehicle B | VB0 | m/s | [11.5, 28.2] |
Merging gap | D0 | m | [26.0, 68.0] |
Distance between vehicle A and vehicle C | DL0 | m | [0.0, 69.0] |

Fig.8 Diagram of the test scenario
A hardware-in-the-loop simulation platform is constructed based on the virtual test drive (VTD). First, a ramp scene is constructed in the simulation environment, which consists of a two-lane main road and an acceleration lane of 226 m. Next, two concrete scenarios[17] are selected based on the logical scenarios in
Parameter | Symbol/Unit | Concrete scenario No.1 | Concrete scenario No.2 |
---|---|---|---|
Initial speed of vehicle A | VA0/(m/s) | 13.3 | 13.1 |
Relative speed between vehicle A and vehicle C | ΔV/(m/s) | -3.2 | -5.2 |
Initial speed of vehicle B | VB0/(m/s) | 12.8 | 13.9 |
Merging gap | D0/m | 21.0 | 86.0 |
Distance between vehicle A and vehicle C | DL0/m | 9.0 | 39.0 |
Using the data of the simulation test, the harmony with traffic of vehicle A is evaluated by the back propagation neural network (BPNN) mapping model for evaluatio
Scenario | Score of harmony with traffic |
---|---|
Concrete scenario No.1 | 2.1 |
Concrete scenario No.2 | 2.9 |
Taking concrete scenario No. 1 as an example, in

Fig.9 Test process of concrete scenario No.1
In order to analyze the difference in driving behavior between ADS and a human driver. Taking concrete scenario No. 1 as an example (see
Interaction Performance | Human driver | ADS | |
---|---|---|---|
Interactive mode | FLC | USCLC | |
Vehicle A | Performance | High | Low |
Driving action | Merging into the main road at the middle of the acceleration lane | Merging into the main road at the end of the acceleration lane | |
Vehicle C | Performance | Low | High |
Driving action | Decelerate | Lane change |
It can be seen from the above test and evaluation results that typical ramp scenarios could be used to reveal the social cooperation capabilities of autonomous vehicles. Therefore, it is effective to generate typical ramp scenarios by clustering analysis based on NDD.
In order to extract typical ramp scenarios, typical interaction modes are studied by clustering analysis based on NDD. Typical logical scenarios are extracted based on the clustering results. In addition, two concrete scenarios derived from logical scenarios are selected to test and evaluate ADS. The conclusions are listed as follows:
(1) Clustering analysis of the interaction between vehicles through major element SAP is an effective way to extract the typical interaction mode and scenarios of ramp driving based on NDD.
(2) By selecting the variables to characterize the A and the P as clustering features, including the speed differences, the merging position, and the THW, 8 kinds of categories are obtained by the K-means clustering method based on CH index. Moreover, 4 kinds of typical interaction modes distinguished by whether there are cooperative behavior and potential risk are obtained by analyzing the variables above.
(3) The variables that characterize the S are selected as scenario parameters, such as the relative speed between the merging vehicle and the main-road vehicle, the merging gap, the speed of the merging vehicle and so on. The distribution of those scenario parameters is analyzed by confidence ellipse. According to this, typical logical scenarios are extracted.
(4) The results of the ADS test and evaluate based on the concrete scenarios derived from logical scenarios show that testing with typical ramp scenarios can reveal the ADS’s capability of harmony with traffic, which can reflect the social cooperation capabilities of autonomous vehicles.
In the future, the sample data in the type of interaction with the leading vehicle on the main road will be considered, which is the second major type as shown in
References
MENG H, XING X, CHEN J, et al. Comprehensive evaluation framework of autonomous vehicles' intelligence (CEFAVI)[C]// 2019 SAECCE-ICV. Shanghai: China Society of Automotive Engineering, 2019: 64. [Baidu Scholar]
CHEN J Y, CHEN L, MENG H L, et al. Evaluation method of the harmony with traffic based on neural network[J]. Journal of Tongji University (Natural Science), 2021, 49(1): 135. [Baidu Scholar]
WANG E, SUN J, SHUN J, et al. Modeling the various merging behaviors at expressway on-ramp bottlenecks using support vector machine models[J]. Transportation Research Procedia, 2016, 25: 1327. [Baidu Scholar]
BAGSCHIK G, MENZEL T, MAURER M. Ontology based scene creation for the development of automated vehicles[C]// 2018 IEEE Intelligent Vehicles Symposium (IV). Suzhou: IEEE, 2018: 1813. [Baidu Scholar]
MENZEL T, BAGSCHIK G, MAURER M. Scenarios for development, test and validation of automated vehicles[C]// 2018 IEEE Intelligent Vehicles Symposium (IV). Suzhou: IEEE, 2018: 1821. [Baidu Scholar]
WU B, ZHU X C, SHENG J P, et al. Analysis of causation of rear-end incidents based on naturalistic driving study[J]. Journal of Tongji University (Natural Science), 2018, 46(9): 1253. [Baidu Scholar]
HU L, YI P, HUANG J, et al. A research on test scenes of two-wheeled vehicles for automatic emergency braking system based on real accident cases[J]. Automotive Engineering, 2018, 40(12): 1435. [Baidu Scholar]
LIU Z Q, WANG J Y, WANG P, et al. Study on lane change behavior on expressway on-ramp merging area based on BP neural network[J]. Journal of Highway and Transportation Research and Development, 2014, 31(9): 120. [Baidu Scholar]
KITA H. Effects of merging lane length on the merging behavior at expressway on-ramps[C]// Proceeding of the 12th International Symposium on the Theory of Traffic Flow and Transportation. Berkeley: UC Berkeley Transportation Library, 1993: 37. [Baidu Scholar]
TAN Y L, JIA H F. Vehicle interaction behaviors model based on drivers characteristics at expressway-ramp merging area[C]// International Conference on Information Management, Innovation Management and Industrial Engineering (ICIII). Xi'an: IEEE, 2013: 371. [Baidu Scholar]
HIADS P. Modeling vehicle interactions in microscopic simulation of merging and weaving[J]. Transportation Research Part C: Emerging Technologies, 2005, 13(1): 37. [Baidu Scholar]
KONDYLI A, ELEFTERIADOU L. Modeling driver behavior at freeway-ramp merges[J]. Transportation Research Record: Journal of the Transportation Research Board, 2011, 2249(1): 29. [Baidu Scholar]
AKAGI Y, KATO R, KITAJIMA S, et al. A risk-index based sampling method to generate scenarios for the evaluation of automated driving vehicle safety[C]// 2019 IEEE Intelligent Transportation Systems Conference (ITSC). Auckland: IEEE, 2019: 667. [Baidu Scholar]
SUN J, ZHANG Y H, WANG J Y. Detection distraction behavior of drivers using naturalistic driving data[J]. China Journal of Highway and Transport, 2020, 33(9): 225. [Baidu Scholar]
WANG E G, SUN J. Exploring freeway merging behavior using dynamic bayesian network models[C]// International Conference on Transportation and Development. Pittsburgh: ASCE, 2018: 120. [Baidu Scholar]
REZAEE M R, LELIEVELDT B F, REIBER J H C. A new cluster validity index for the fuzzy C-mean[J]. Pattern Recognition Letters, 1998, 19(3/4): 237. [Baidu Scholar]
MENZEL T, BAGSCHIK G, MAURER M. Scenarios for development, test and validation of automated vehicles[C]// Proceeding of 2018 IEEE Intelligent Vehicles Symposium (IV) Changshu: IEEE, 2018:1821. [Baidu Scholar]