Summary and final thoughts

Throughout this lecture, we have examined the small data problem in remote sensing for agriculture, a challenge that continues to shape the design and deployment of deep learning models in real-world environmental contexts. While the volume of remote sensing data has grown exponentially in recent years, the availability of accurate, high-resolution, and representative ground truth data has not kept pace. As we discussed, this imbalance creates a small data regime, where the limitations of training data restrict model development, generalisation, and reliability. This is particularly problematic in agriculture, where spatial and temporal variability, field-level decision-making, and policy implications demand models that are both robust and interpretable.

The first part of the lecture introduced the small data problem and unpacked its causes, ranging from the high cost of field data collection to issues of spatial imbalance, temporal mismatch, and label noise. These limitations are not just technical hurdles; they reflect broader structural, geographic, and institutional constraints that shape the data landscape. Understanding these constraints is essential for designing realistic and context-appropriate solutions.

We then turned our attention to the unique challenges posed by limited ground truth data in socio-environmental applications more broadly. In agricultural systems, the variability of ecosystems, crops, and land management practices introduces complexity that cannot be fully captured by small, localized datasets. Moreover, socio-political barriers such as data access restrictions and privacy regulations further limit what data can be used, shared, or generalized. These challenges underscore the need for flexible, data-efficient modeling strategies that go beyond conventional supervised learning paradigms. The core of the lecture focused on ten emerging deep learning techniques designed to address the small data problem. These included transfer learning, self-supervised learning, semi-supervised learning, few-shot and zero-shot learning, active learning, weakly supervised learning, multitask learning, process-aware learning, and ensemble learning. Each of these techniques brings a different advantage. Some enable models to learn from unlabeled or noisy data. Others reduce the number of labeled examples required or incorporate external knowledge and domain constraints. Collectively, these techniques form a toolkit for data-efficient modeling that is both powerful and adaptable to different remote sensing tasks.

We then explored how these methods are applied in practice. In crop classification, for example, transfer learning and self-supervised learning have become essential for improving model accuracy when only limited field labels are available. Semi-supervised learning and active learning allow models to make better use of the unlabeled imagery that dominates remote sensing archives. Few-shot and zero-shot learning offer solutions for dealing with rare or emerging crop types, while multitask and process-aware learning strengthen model robustness by incorporating biological or ecological knowledge. Ensemble methods provide additional reliability and stability, particularly in uncertain or variable conditions. These techniques are not isolated; they are increasingly being used in combination to address complex, real-world modeling needs.

We also emphasised that model evaluation is as important as model development. Without a robust evaluation framework, even the most sophisticated models can produce misleading results. Spatial k-fold cross-validation offers a reliable way to assess model performance by separating training and test data geographically, avoiding artificial inflation of accuracy due to spatial autocorrelation. Especially in small data settings, this form of validation is essential to ensure that models will generalise to new areas and conditions.

Taken together, these insights point toward a future where deep learning in remote sensing is both more inclusive and more adaptable. Rather than requiring massive labeled datasets, we now have the tools to train and evaluate models with limited data, improving accessibility for regions and communities that were previously excluded from AI-driven agriculture. This shift supports broader goals of global sustainability, food security, and equitable access to digital technologies in agricultural systems.

As we conclude this lecture, it is important to recognize that addressing the small data problem is not just about applying the right algorithm. It requires a holistic understanding of the data environment, the goals of the analysis, the resources available, and the social and ecological context in which the model will be used. Deep learning models must be designed not only to perform well technically, but also to support transparent, ethical, and context-sensitive decision-making. This lecture has provided an overview of the strategies available to meet that challenge. In the following lectures, we will continue to deepen our understanding by exploring specific techniques in more detail, beginning with self-supervised learning and its role in low-label agricultural remote sensing.

References:

A. Safonova, G. Ghazaryan, S. Stiller, M. Main-Knorn, C. Nendel, and M. Ryo, “Ten deep learning techniques to address small data problems with remote sensing,” International Journal of Applied Earth Observation and Geoinformation, vol. 125, p. 103569, 2023.
M. Ryo et al., “Deep learning for sustainable agriculture needs ecology and human involvement,” Journal of Sustainable Agriculture and Environment, vol. 2, no. 1, pp. 40–44, 2023, doi: 10.1002/sae2.12036.
Song, L. et al. (2022). Self-Supervised Assisted Semi-Supervised Residual Network for Hyperspectral Image Classification. Remote Sensing, 14(13): 2997.
Liu, B. et al. (2023). Deep Contrastive Learning Network for Small Sample Hyperspectral Image Classification. Remote Sensing.