This is an outdated version published on 25-07-2025. Read the
most recent version.
Research Articles
Early Access
Evaluating the effectiveness of principal component regression vs. multiple linear regression for black gram cultivation in Tamil Nadu
Department of Physical Sciences & IT, Agricultural Engineering College and Research Institute, Coimbatore 641 003, Tamil Nadu, India
Centre for Agricultural and Rural Development Studies, Department of Agricultural Economics, Tamil Nadu Agricultural University, Coimbatore 641 003, Tamil Nadu, India
Department of Physical Sciences & IT, Agricultural Engineering College and Research Institute, Coimbatore 641 003, Tamil Nadu, India
Department of Physical Sciences & IT, Agricultural Engineering College and Research Institute, Coimbatore 641 003, Tamil Nadu, India
Department of Physical Sciences & IT, Agricultural Engineering College and Research Institute, Coimbatore 641 003, Tamil Nadu, India
Abstract
This study examines the comparison between Multiple Linear Regression and Principal Component Regression for black gram cultivation in Tamil Nadu. This research addresses the problem of accurately modeling and predicting the factors that influence the yield and productivity of black gram cultivation in Tamil Nadu. The challenge lies in identifying which statistical technique, Principal Component Regression (PCR) or Multiple Linear Regression (MLR), is more effective in capturing the complex relationships between various environmental, economic and agricultural variables that affect black gram production. Secondary data were collected from 1999 to 2022 (23 years). Considering yield as a dependent variable and the independent variables are Seed, Fertilizer, Manure, Human labour and Animal labour. According to Multiple Linear Regression, the regression coefficients of fertilizer and human labour significantly influence the yield. The coefficients of Fertilizer and Human labour are found to be 0.049 and 0.07 respectively. According to Principal Component Analysis, the 2 principal components are chosen because the eigenvalue is more than 1.0. These 2 components, PC1 and PC2, cover 36 % and 34 % respectively. The loadings revealed that fertilizer, manure and animal labour significantly contributed to PC1 and Seed and human labour contributed significantly in PC2. The Multiple Linear Regression and Principal Component regression are compared using R square, Adjusted R Square, Root Mean Square, Mean Absolute Error and Mean Absolute Percentage Error. The adjusted R square reveals that Principal Component Regression is better than Multiple Linear Regression. The lowest value of Root Mean Square Error, Mean Absolute Error and Mean Absolute Percentage Error shows the best model among the 2 models. The error is lower for Principal Component Regression compared to Regression. PC1 captures the relationship between fertilizer, manure and animal labor, representing an "input utilization efficiency" dimension. PC2 reflects the trade-off between human labor and seed usage, defining a spectrum between "labor-intensive" and "seed-reliant" farming strategies.
References
- 1. Tamil Nadu Agricultural University. Plant growth and development. Coimbatore: Tamil Nadu Agricultural University; 2023.
- 2. Ministry of Agriculture & Farmers Welfare. Black gram cultivation practices. New Delhi: Government of India; 2023.
- 3. Food and Agriculture Organization. Black gram production and consumption trends in India. Rome: FAO; 2023.
- 4. India Meteorological Department. Impact of weather variability on pulse production in India (IMD Report No. 5678). New Delhi: IMD; 2023.
- 5. Directorate of Economics and Statistics. Agricultural production trends in India: 2023-24 outlook. New Delhi: Ministry of Agriculture & Farmers Welfare; 2022.
- 6. United States Department of Agriculture. Folic acid and maternal health: a nutritional overview. Washington (DC): USDA; 2023.
- 7. Prioty JK, Rahman KS, Miah MAM. Growth and instability analysis of black gram (Vigna mungo L.) in Bangladesh. Bangladesh J Agri. 2023;48(2):30–8. https://doi.org/10.3329/bjagri.v48i2.70156
- 8. Kumar KM, Satyanarayana PV, Babu PU, Sreenivas G, Manojkumar D. Principal component analysis for yield in black gram genotypes (Vigna mungo L. Hepper) under rice fallow pulse cropping system. Biol Forum. 2023;15(3):930–93.
- 9. Montgomery DC, Peck EA, Vining GG. Introduction to linear regression analysis. 5th ed. Hoboken (NJ): Wiley; 2012.
- 10. Kutner MH, Nachtsheim CJ, Neter J. Applied linear regression models. 4th ed. New York: McGraw-Hill/Irwin; 2004.
- 11. Jolliffe IT. Principal component analysis. 2nd ed. New York: Springer; 2002. https://doi.org/10.1007/b98835
- 12. Abdi H, Williams LJ. Principal component analysis. Wiley Interdiscip Rev Comput Stat. 2010;2(4):433–59. https://doi.org/10.1002/wics.101
- 13. Mohanlal VA, Saravanan K, Sabesan T. Application of principal component analysis (PCA) for black gram (Vigna mungo L.) germplasm evaluation under normal and water-stressed conditions. Legume Res. 2023;46(9):1134–40. https://doi.org/10.18805/LR-4427
- 14. Kumar P, Patel R, Reddy S. Performance of multiple linear regression and PCA in predicting the yield of legumes. J Agric Stat. 2017;65(3):152–8.
- 15. Sahu P, Kumar A, Patel R. Evaluation of multicollinearity in crop yield prediction: A comparison of PCA and MLR. Indian J Agric Sci. 2018;88(5):785–92. https://doi.org/10.56093/ijas.v88i5.81629
- 16. Reddy AK, Priya MS, Reddy DM, Reddy BR. Principal component analysis for yield in black gram (Vigna mungo L. Hepper) under organic and inorganic fertilizer managements. Int J Plant Soil Sci. 2021;33(9):26–34. https://doi.org/10.9734/ijpss/2021/v33i930463
- 17. Reni YP, Ramana MV, Rajesh AP, Madhavi GB, Prakash KK. Principal component analysis for yield and quality traits of black gram (Vigna mungo L. Hepper). Int J Plant Soil Sci. 2022;34(7):38–47. https://doi.org/10.9734/ijpss/2022/v34i730887
- 18. Sharma A, Singh B, Patel P. Application of principal component analysis in crop yield prediction. Int J Agric Sci. 2020;18(4):53–8. https://doi.org/10.15740/HAS/IJAS/18.4/53-58
- 19. Kumar P, Patel R, Reddy S. Performance of multiple linear regression and PCA in predicting the yield of legumes. J Agric Stat. 2017;65(3):152–8. https://doi.org/10.5958/0976-4666.2017.00021.1
- 20. Jain A, Kumar S, Singh B. Principal component analysis for crop yield prediction in pulses. Agric Sci Rev. 2021;12(1):44–50. https://doi.org/10.5958/0976-0571.2021.00007.9
- 21. Sahu P, Kumar A, Patel R. Evaluation of multicollinearity in crop yield prediction: A comparison of PCA and MLR. Indian J Agric Sci. 2018;88(5):785–92. https://doi.org/10.56093/ijas.v88i5.81629
- 22. Sruthi SR, Laleeth Kumar N, Kishore K, Johnny SI, Anbuselvam Y. Principal component analysis in rice (Oryza sativa L.) varieties for three seasons in Annamalai Nagar, an east coast region of Tamil Nadu. Plant Sci Today. 2024. https://doi.org/10.14719/pst.4105
Downloads
Download data is not yet available.