Methodological remarks
The Használtautó.hu - HCSO experimental statistics were created in the framework of a cooperation agreement with Használtautó.hu Ltd. The database under study is the total supply data set of car advertisements managed by Használtautó.hu, for the 6th, 15th and 24th of each month. With the exception of Figure 4, which shows the variation over time, duplicates have been filtered out to produce statistics for the whole month. If an advertisement has a changed asking price within the month, the price in the statistics is the average of the three days of the month.
Cases where a statistically relevant data gap or data error occurred were removed from the dataset. In addition, vehicles less than one year old, vehicles with 0 kilometres and vehicles less than HUF 50,000 were filtered out.
The fuel types distinguished by Használtautó.hu were classified in simplified categories. The table below shows the matching:
Használtautó.hu category
Categories for the experimental statistics publication
Diesel
Diesel
Petrol
Petrol
Petrol/gas
Other
Diesel/gas
Other
Hybrid (petrol)
Hybrid
Hybrid (diesel)
Hybrid
Hybrid
Hybrid
Electric
Electric
LPG
Other
CNG
Other
Hydrogen/electric
Other
Ethanol
Other
Gas
Other
Biodiesel
Other
Other
Other
Empty
Empty
The components of used car prices were examined using a regression model. This procedure quantifies partial effects, assuming that all other factors remain unchanged. The natural logarithm of the asking price of motor vehicles was used as the dependent variable in the model, which was explained by the following independent variables:
-
the age of the vehicle in years, interactions with the five fuel types (petrol, diesel, hybrid, electric, and other)
-
mileage in 1,000 km
-
cylinder capacity in 100 cm3-ben
-
power in kilowatts
-
condition of the car as dummy variables (excellent, normal, well-maintained, undamaged, like new, damaged/faulty)
-
brand groups
The brands were grouped using Ward clustering, which created eight distinct categories based on the following variables: asking price, age, mileage, cylinder capacity, and power. Peripheral brands with fewer than 10 cases, which accounted for approximately 0.1% of the data set, were excluded from the process.
In the case of the regression model, the heteroscedasticity tests and normality tests showed satisfactory results.
Methodological remarks
The Használtautó.hu - HCSO experimental statistics were created in the framework of a cooperation agreement with Használtautó.hu Ltd. The database under study is the total supply data set of car advertisements managed by Használtautó.hu, for the 6th, 15th and 24th of each month. With the exception of Figure 4, which shows the variation over time, duplicates have been filtered out to produce statistics for the whole month. If an advertisement has a changed asking price within the month, the price in the statistics is the average of the three days of the month.
Cases where a statistically relevant data gap or data error occurred were removed from the dataset. In addition, vehicles less than one year old, vehicles with 0 kilometres and vehicles less than HUF 50,000 were filtered out.
The fuel types distinguished by Használtautó.hu were classified in simplified categories. The table below shows the matching:
Használtautó.hu category | Categories for the experimental statistics publication |
---|---|
Diesel | Diesel |
Petrol | Petrol |
Petrol/gas | Other |
Diesel/gas | Other |
Hybrid (petrol) | Hybrid |
Hybrid (diesel) | Hybrid |
Hybrid | Hybrid |
Electric | Electric |
LPG | Other |
CNG | Other |
Hydrogen/electric | Other |
Ethanol | Other |
Gas | Other |
Biodiesel | Other |
Other | Other |
Empty | Empty |
The components of used car prices were examined using a regression model. This procedure quantifies partial effects, assuming that all other factors remain unchanged. The natural logarithm of the asking price of motor vehicles was used as the dependent variable in the model, which was explained by the following independent variables:
-
the age of the vehicle in years, interactions with the five fuel types (petrol, diesel, hybrid, electric, and other)
-
mileage in 1,000 km
-
cylinder capacity in 100 cm3-ben
-
power in kilowatts
-
condition of the car as dummy variables (excellent, normal, well-maintained, undamaged, like new, damaged/faulty)
-
brand groups
The brands were grouped using Ward clustering, which created eight distinct categories based on the following variables: asking price, age, mileage, cylinder capacity, and power. Peripheral brands with fewer than 10 cases, which accounted for approximately 0.1% of the data set, were excluded from the process.
In the case of the regression model, the heteroscedasticity tests and normality tests showed satisfactory results.