TY - JOUR T1 - Vectorial Feature Engineering of Temporal Variables in GRU-based Machine Learning Models for Temperature Analysis in Puno A1 - Leonel Coyla-Idme A1 - Fred Torres-Cruz A1 - Ernesto Nayer Tumi-Figueroa A1 - Elqui-Yeye Pari-Condori A1 - Leonid Aleman Gonzales A1 - Jose Pánfilo Tito Lipa JF - World Journal of Environmental Biosciences JO - World J Environ Biosci SN - 2277-8047 Y1 - 2026 VL - 15 IS - 1 DO - 10.51847/bO2tYhjWMD SP - 97 EP - 105 N2 - Accurate prediction of daily mean temperature in high-altitude environments remains challenging due to pronounced thermal variability, strong seasonal oscillations, and data limitations typical of remote meteorological stations. This study proposes a vectorial feature engineering strategy for temporal variables within a Gated Recurrent Unit (GRU) framework, applied to the Puno station (3,827 m a.s.l.) in the Peruvian Altiplano. The dataset comprises 7,279 daily observations (2003–2024), preprocessed through a four-stage pipeline: sentinel value detection, dual-consensus outlier removal (interquartile range and monthly Z-score), cubic spline imputation, and Min–Max normalization. The core contribution is the cyclic sine–cosine encoding of calendar variables (month, day), projecting discrete temporal indices onto the unit circle in ℝ². This representation removes artificial discontinuities at calendar boundaries (e.g., December–January), which persist under conventional linear scaling. The proposed architecture—GRU (64), SpatialDropout1D (0.2), GRU (32), Dense (16, ReLU), Dense (1)—is trained using 30-day sliding windows for one-step-ahead forecasting. A controlled ablation study compares the proposed approach against an identical baseline using normalized but non-cyclic temporal features. On the test set (n = 1,088; 2021–2024), cyclic encoding yields consistent improvements: MAE decreases from 0.0663 to 0.0655 (+1.17%), RMSE from 0.0836 to 0.0829 (+0.82%), MAPE from 13.20% to 12.63% (+4.32%), and R² increases from 0.6705 to 0.6758 (+0.80%). The denormalized error (0.81 °C) remains within standard measurement uncertainty, confirming the effectiveness of cyclic temporal encoding as an inductive bias without increasing model complexity. UR - https://environmentaljournals.org/article/vectorial-feature-engineering-of-temporal-variables-in-gru-based-machine-learning-models-for-tempera-zuaqnj4ytdyqtx9 ER -