Singular Value Decomposition and factor analysis in Social Science and Humanities
DOI:
https://doi.org/10.46661/rev.metodoscuant.econ.empresa.8004Keywords:
Singular Value Decomposition, Factor Analysis, Humanities and Social SciencesAbstract
The objects of study of the humanities and social sciences are intrinsically complex. Because it is philosophically attractive, and because it helps in practice to manage
such complexity, one of the most influential central ideas throughout the history and present of these disciplines is the notion that the large number of empirical manifestations that characterize their objects of study are actually expressions of a few factors that influence all other variables. The corresponding statistical methodology to implement these ideas has different names and differs in detail in different disciplines, but one name that can be recognized in many of them is “factor analysis”. The first objective of this work is to present a classical method of
linear algebra, known as “Singular Value Decomposition” (SVD), in an intuitive and at the same time rigorous way to the community of human and social sciences.
SVD systematizes and generalizes the factorization of any data matrix. In addition, the method is of enormous importance in the era of big data and machine learning,
which are increasingly influencing research in all areas of study. The second objective is to invite questioning of certain hypotheses in traditional factor analysis. The SVD
reveals that factors are inherent in any matrix-structured data set; what is crucial is how singular values decay. Data will determine this decay, with potentially profoundly
transformative theoretical repercussions.
Downloads
References
Alter, O., Brown, P. O., & Botstein, D. (2000). Singular value decomposition for genome-wide expression data processing and modeling. Proceedings of the National Academy of Sciences of the United States of America, 97(18), 10101-10106.
Athey, S., Bayati, M., Doudchenko, N., Imbens, G., & Khosravi, K. (2021). Matrix completion methods for causal panel data models. Journal of the American Statistical Association, 116(536), 1716-1730.https://doi.org/10.1080/01621459.2021.1891924
Athey, S. (2019). 21. The Impact of Machine Learning on Economics. In A. Agrawal, J. Gans & A. Goldfarb (Ed.), The Economics of Artificial Intelligence: An Agenda (pp. 507-552). Chicago: University of Chicago Press.https://doi.org/10.7208/chicago/9780226613475.003.0021
Athey, S., & Imbens, G. W. (2019). Machine learning methods that economists should know about. Annual Review of Economics, 11, 685-725.https://doi.org/10.1146/annurev-economics-080217-053433
Bai, J., & Ng, S. (2002). Determining the number of factors in approximate factor models. Econometrica, 70(1), 191-221.https://doi.org/10.1111/1468-0262.00273
Bai, J., & Ng, S. (2008). Large dimensional factor analysis. Foundations and Trends® in Econometrics, 3(2), 89-163.
https://doi.org/10.1561/0800000002
Bolukbasi, T., Chang, K. W., Zou, J. Y., Saligrama, V., & Kalai, A. T. (2016). Man is to computer programmer as woman is to homemaker? Debiasing word embeddings. Advances in Neural Information Processing Systems, 29: https://scholar.google.com/scholar_lookup?arxiv_id=1607.06520
Burt, C. (1909). Experimental tests of general intelligence. British Journal of Psychology, 3(1), 94.https://doi.org/10.1111/j.2044-8295.1909.tb00197.x
Deerwester, S., Dumais, S. T., Furnas, G. W., Landauer, T. K., & Harshman, R. (1990). Indexing by latent semantic analysis. Journal of the American Society for Information Science, 41(6), 391-407.https://doi.org/10.1002/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO;2-9
Eckart, C., & Young, G. (1936). The approximation of one matrix by another of lower rank. Psychometrika, 1(3), 211-218.https://doi.org/10.1007/BF02288367
Maxwell-Garnett, J. C. (1919). On certain independent factors in mental measurements. Proceedings of the Royal Society of London. Series A, Containing Papers of a Mathematical and Physical Character, 96(675), 91-111.https://doi.org/10.1098/rspa.1919.0041
Harman, H. H. (1976). Modern factor analysis. University of Chicago press.
Holter, N. S., Mitra, M., Maritan, A., Cieplak, M., Banavar, J. R., & Fedoroff, N. V. (2000). Fundamental patterns underlying gene expression profiles: simplicity from complexity. Proceedings of the National Academy of Sciences, 97(15), 8409-8414.
https://doi.org/10.1073/pnas.150242097
Holzinger, K. J. (1930). Statistical résumé of the Spearman two-factor theory. (Mimeographed).
Jolliffe, I. T., & Cadima, J. (2016). Principal component analysis: a review and recent developments. Philosophical transactions of the royal society A: Mathematical, Physical and Engineering Sciences, 374(2065), 20150202.https://doi.org/10.1098/rsta.2015.0202
Lay, D. C. (2007). Álgebra Lineal y sus Aplicaciones. Pearson educación.
Liberty, E., Woolfe, F., Martinsson, P. G., Rokhlin, V., & Tygert, M. (2007). Randomized algorithms for the low-rank approximation of matrices. Proceedings of the National Academy of Sciences, 104(51), 20167-20172.https://doi.org/10.1073/pnas.0709640104
Lintner, J. (1975). The valuation of risk assets and the selection of risky investments in stock portfolios and capital budgets. In Stochastic optimization models in finance (pp. 131-155). Academic Press.
Mossin, J. (1966). Equilibrium in a capital asset market. Econometrica: Journal of the Econometric Society, 768-783.https://doi.org/10.2307/1910098
Muller, N., Magaia, L., & Herbst, B. M. (2004). Singular value decomposition, eigenfaces, and 3D reconstructions. SIAM Review, 46(3), 518-545.https://doi.org/10.1137/S0036144501387517
Novembre, J., Johnson, T., Bryc, K., Kutalik, Z., Boyko, A. R., Auton, A., ... & Bustamante, C. D. (2008). Genes mirror geography within Europe. Nature, 456(7218), 98-101.https://doi.org/10.1038/nature07331
Pearson, K. (1901). LIII. On lines and planes of closest fit to systems of points in space. The London, Edinburgh, and Dublin Philosophical Magazine and Journal of Science, 2(11), 559-572.https://doi.org/10.1080/14786440109462720
Ross, S. A. (2013). The arbitrage theory of capital asset pricing. In Handbook of the fundamentals of financial decision making: Part I (pp. 11-30).https://doi.org/10.1142/9789814417358_0001
Sharpe, W. F. (1964). Capital asset prices: A theory of market equilibrium under conditions of risk. The Journal of Finance, 19(3), 425-442.
https://doi.org/10.1111/j.1540-6261.1964.tb02865.x
Spearman, C. (1904). General Intelligence, Objectively Determined and Measured. The American Journal of Psychology, 15(2), 201-292. https://doi.org/10.2307/1412107.
Spearman, C. (1927). The Abilities of Man: Their Nature and Measurement. Journal of Philosophical Studies, 2(8), 557-560.https://doi.org/10.1017/S0031819100015825
Stewart, G. (1993): On the Early History of the Singular Value Decomposition". SIAM Review, 35( 4), 551-566.https://doi.org/10.1137/1035134
Stock, J. & Watson, M. (2011). Dynamic factor models. Oxford Handbooks Online.https://doi.org/10.1093/oxfordhb/9780195398649.013.0003
Stock, J. & Watson, M. (2015). Factor Models for Macroeconomics. En Taylor, J. B. y Uhlig, H. (Eds.), Handbook of Macroeconomics (Vol. 2). North Holland.
Thomson, G. (1938). Methods of Estimating Mental Factors. Nature, 141, 246.https://doi.org/10.1038/141246a0
Thurstone, L. (1931). Multiple factor analysis. Psychological Review, 38(5), p. 406.https://doi.org/10.1037/h0069792
Thurstone, L. (1947). Multiple-factor analysis; a development and expansion of The Vectors of Mind. University of Chicago Press.
Turk, M., & Pentland, A. (1991a). Eigenfaces for recognition. Journal of cognitive neuroscience, 3(1), 71-86. https://scholar.google.com.ar/scholar?q=Eigenfaces+for+recognition.+Journal+of+Cognitive+Neuroscience,+(3).&hl=es&as_sdt=0&as_vis=1&oi=scholarthttps://doi.org/10.1162/jocn.1991.3.1.71
Turk, M. & Pentland, A. (1991b). Face recognition using Eigenfaces. Proc. of Computer Vision and Pattern Recognition, (3), 586-591.
Published
How to Cite
Issue
Section
License
Copyright (c) 2023 Sergio A. Pernice

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Submission of manuscripts implies that the work described has not been published before (except in the form of an abstract or as part of thesis), that it is not under consideration for publication elsewhere and that, in case of acceptance, the authors agree to automatic transfer of the copyright to the Journal for its publication and dissemination. Authors retain the authors' right to use and share the article according to a personal or instutional use or scholarly sharing purposes; in addition, they retain patent, trademark and other intellectual property rights (including research data).
All the articles are published in the Journal under the Creative Commons license CC-BY-SA (Attribution-ShareAlike). It is allowed a commercial use of the work (always including the author attribution) and other derivative works, which must be released under the same license as the original work.
Up to Volume 21, this Journal has been licensing the articles under the Creative Commons license CC-BY-SA 3.0 ES. Starting from Volume 22, the Creative Commons license CC-BY-SA 4.0 is used.