Last updated 1 weeks ago•January 2, 2026
Time:One complete year of daily observations (366 days - leap year)
Location:Australia
Created by Dataset Agent
Dataset Overview
The Australian Weather Dataset provides a complete year of daily meteorological observations from the Australian Bureau of Meteorology, specifically curated for machine learning rain prediction tasks. This dataset captures the diverse climate patterns characteristic of the Australian continent, from cool winter mornings to hot summer days, making it an excellent benchmark for weather forecasting models.
The dataset contains 366 daily weather observations with 22 meteorological variables, providing a complete leap year of weather data.
View Source
Temperature ranges from -5.3°C to 35.8°C, with an average daily maximum of 20.6°C and average minimum of 12.2°C.
View Source
Total annual rainfall recorded is 522.8mm, with rain occurring on 18% of days (66 days with measurable precipitation).
View Source
Data Preview
Below is a sample of the dataset showing key meteorological variables. The data includes both current-day observations and the target variable for next-day rain prediction:
Sample Weather Observations (5 rows)
| Min Temp | Max Temp | Rainfall | Wind Gust Dir | Wind Gust Speed | Humidity9am | Humidity3pm | Rain Today | Rain Tomorrow |
|---|---|---|---|---|---|---|---|---|
| 8 | 24.3 | 0 | NW | 30 | 68 | 29 | false | true |
| 14 | 26.9 | 3.6 | ENE | 39 | 80 | 36 | true | true |
| 13.7 | 23.4 | 3.6 | NW | 85 | 82 | 69 | true | true |
| 13.3 | 15.5 | 39.8 | NW | 54 | 62 | 56 | true | true |
| 7.6 | 16.1 | 2.8 | SSE | 50 | 68 | 49 | true | false |
| 5 rows | ||||||||
View Source
Climate Patterns and Seasonal Insights
Analysis reveals distinct meteorological patterns characteristic of Australian climate. The data shows clear relationships between atmospheric conditions and rainfall occurrence, with humidity and pressure serving as strong predictive indicators.
View Source
The temperature distribution shows 49% of days fall in the mild range (15-25°C), while extreme temperatures (below 15°C or above 30°C) account for 35% of observations. This distribution is typical of temperate Australian regions.
View Source
Heavy rainfall events (>20mm) are rare, occurring only 3 days throughout the year, while 72% of days recorded no measurable precipitation.
View Source
Wind Pattern Analysis
Wind patterns provide crucial predictive signals for weather forecasting. The dataset captures 16 compass directions for wind gusts, with northwesterly winds being the most prevalent.
View Source
Average wind gust speed is 39.8 km/h, with maximum recorded gusts reaching 98 km/h. Northwesterly winds (NW, NNW, WNW) account for 42% of all observations.
View Source
Rain Prediction Target Analysis
The primary ML target variable RainTomorrow indicates whether rain occurred the following day. The dataset also includes RISK_MM showing the expected rainfall amount, enabling both classification and regression tasks.
View Source
The correlation analysis reveals important predictive patterns:
- When it rains today: 32% probability of rain tomorrow (21 out of 66 rainy days)
- When it doesn't rain today: 15% probability of rain tomorrow (45 out of 300 dry days)
- Overall class distribution: 18% positive (rain) vs 82% negative (no rain) - moderate imbalance
Atmospheric Measurements
The dataset captures atmospheric conditions at two standardized times daily (9am and 3pm), following Bureau of Meteorology observation protocols. This dual-reading approach reveals diurnal patterns critical for accurate forecasting.
View Source
Morning humidity averages 72% while afternoon readings drop to 44%, a 28 percentage point decrease reflecting typical evaporation patterns. Atmospheric pressure remains stable: 1019.7 hPa (9am) to 1016.8 hPa (3pm).
View Source
Data Quality Summary
Understanding data quality is essential for building reliable models. This dataset is generally complete with a few documented exceptions:
98.4% Complete Records: The Sunshine column contains 6 missing values (1.6%) represented as 'NA' strings. All other columns have complete data for all 366 observations.
Type Conversion Required: WindGustSpeed, WindSpeed9am, WindSpeed3pm, and Sunshine are stored as VARCHAR. Convert to numeric types before mathematical operations:
CAST(WindGustSpeed AS FLOAT)Comparison with Alternative Datasets
How does this dataset compare to other weather data sources?
Methodology and Data Collection
This dataset originates from the Australian Bureau of Meteorology's network of weather stations. Observations follow World Meteorological Organization (WMO) standards for consistency and reliability. Key methodological notes:
- RainToday/RainTomorrow: Binary flag set to TRUE when rainfall exceeds 1mm threshold
- RISK_MM: Continuous target variable representing actual next-day rainfall amount
- Dual observations: 9am and 3pm readings capture diurnal variation patterns
- Wind directions: 16-point compass rose (N, NNE, NE, ENE, E, etc.)
Related Datasets
For extended analysis, consider combining this weather data with complementary datasets:
- Air Quality Data: Correlate weather patterns with pollution levels (PM2.5, ozone)
- Solar Irradiance Data: Combine sunshine hours with solar radiation measurements
- Historical Climate Records: Extend temporal coverage for trend analysis
- Agricultural Yield Data: Model weather impact on crop production
Table Overview
weather
Data Preview
Scroll to see more| MinTemp | MaxTemp | Rainfall | Evaporation | Sunshine | WindGustDir | WindGustSpeed | WindDir9am | WindDir3pm | WindSpeed9am | WindSpeed3pm | Humidity9am | Humidity3pm | Pressure9am | Pressure3pm | Cloud9am | Cloud3pm | Temp9am | Temp3pm | RainToday | RISK_MM | RainTomorrow |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 8 | 24.3 | 0 | 3.4 | 6.3 | NW | 30 | SW | NW | 6 | 20 | 68 | 29 | 1,019.7 | 1,015 | 7 | 7 | 14.4 | 23.6 | false | 3.6 | true |
| 14 | 26.9 | 3.6 | 4.4 | 9.7 | ENE | 39 | E | W | 4 | 17 | 80 | 36 | 1,012.4 | 1,008.4 | 5 | 3 | 17.5 | 25.7 | true | 3.6 | true |
| 13.7 | 23.4 | 3.6 | 5.8 | 3.3 | NW | 85 | N | NNE | 6 | 6 | 82 | 69 | 1,009.5 | 1,007.2 | 8 | 7 | 15.4 | 20.2 | true | 39.8 | true |
| 13.3 | 15.5 | 39.8 | 7.2 | 9.1 | NW | 54 | WNW | W | 30 | 24 | 62 | 56 | 1,005.5 | 1,007 | 2 | 7 | 13.5 | 14.1 | true | 2.8 | true |
| 7.6 | 16.1 | 2.8 | 5.6 | 10.6 | SSE | 50 | SSE | ESE | 20 | 28 | 68 | 49 | 1,018.3 | 1,018.5 | 7 | 7 | 11.1 | 15.4 | true | 0 | false |
Row 1
MinTemp8
MaxTemp24.3
Rainfall0
Evaporation3.4
Sunshine6.3
+17 more columns
Row 2
MinTemp14
MaxTemp26.9
Rainfall3.6
Evaporation4.4
Sunshine9.7
+17 more columns
Row 3
MinTemp13.7
MaxTemp23.4
Rainfall3.6
Evaporation5.8
Sunshine3.3
+17 more columns
Showing 5 of 366 rows
Data Profile
366
rows
22
columns
100%
complete
393.2 KB
estimated size
Column Types
14 Numeric6 Text2 Boolean
High-Cardinality Columns
Columns with many unique values (suitable for identifiers or categorical features)
- Temp3pm(200 unique values)
- Pressure3pm(193 unique values)
- Pressure9am(190 unique values)
- MaxTemp(187 unique values)
Data Dictionary
weather
| Column | Type | Example | Missing Values |
|---|---|---|---|
MinTemp | numeric | 8, 14 | 0 |
MaxTemp | numeric | 24.3, 26.9 | 0 |
Rainfall | numeric | 0, 3.6 | 0 |
Evaporation | numeric | 3.4, 4.4 | 0 |
Sunshine | string | "6.3", "9.7" | 0 |
WindGustDir | string | "NW", "ENE" | 0 |
WindGustSpeed | string | "30", "39" | 0 |
WindDir9am | string | "SW", "E" | 0 |
WindDir3pm | string | "NW", "W" | 0 |
WindSpeed9am | string | "6", "4" | 0 |
WindSpeed3pm | numeric | 20, 17 | 0 |
Humidity9am | numeric | 68, 80 | 0 |
Humidity3pm | numeric | 29, 36 | 0 |
Pressure9am | numeric | 1019.7, 1012.4 | 0 |
Pressure3pm | numeric | 1015, 1008.4 | 0 |
Cloud9am | numeric | 7, 5 | 0 |
Cloud3pm | numeric | 7, 3 | 0 |
Temp9am | numeric | 14.4, 17.5 | 0 |
Temp3pm | numeric | 23.6, 25.7 | 0 |
RainToday | boolean | false, true | 0 |
RISK_MM | numeric | 3.6, 3.6 | 0 |
RainTomorrow | boolean | true, true | 0 |