Template-Type: ReDIF-Article 1.0
Author-Name: Manuel César Vila
Author-Name: Xesús Pereira López
Author-Name: Rosa Maria Verdugo Matés
Title:  Análisis comparativo de las fuentes estadísticas para la proyección de series temporales de migraciones regionales clasificadas por niveles educativos
Abstract: Resumen:En el presente trabajo se analiza la robustez estadística de los censos de población y de la Encuesta de Población Activa con el objetivo de estimar matrices regionales de flujos migratorios clasificados por niveles educativos, mediante variantes del procedimiento iterative proportional fitting, a partir de la información contenida en ellos.Dado que las estadísticas oficiales sobre migraciones no permiten clasificar a los migrantes en función de su nivel formativo, es necesario utilizar fuentes alternativas para modelizar los movimientos poblacionales entre comunidades autónomas teniendo en cuenta su formación.Recurrir a estas fuentes alternativas obliga a realizar un análisis previo de las mismas, para la elección de la que permita la mejor estimación de flujos migratorios interregionales para los diferentes niveles educativos.Con este fin se analiza la robustez de la información migratoria de los censos y de la Encuesta de Población Activa, utilizando diferentes indicadores y procedimientos.Se realiza una primera comparación de ambas fuentes con los datos globales de la Estadística de Variaciones Residenciales; análisis que es complementado con los resultados de las ecuaciones compensatorias para el conjunto español y para cada una de sus comunidades autónomas, a partir de la información de los dos últimos censos de población, calculando a continuación los errores relativos medios (ERM) para los totales poblacionales y para las personas con estudios de segundo y de tercer grado, de cada una de esas regiones.La conclusión de estos análisis es que la variable movilidad de los censos de población se encuentra afectada por la falta de respuesta y por el método de imputación utilizado por el propio INE para corregirla, explicando la infravaloración de los datos de los censos con respecto a la EVR.A partir de los informes de evaluación de la calidad de los datos del Censo de Población 2001 y de la Encuesta de Población Activa para los años 2006 a 2017, se resumen los resultados de varios indicadores como son el porcentaje de idénticamente clasificados, la tasa de diferencia neta, el índice de cambio neto o la tasa de diferencia bruta; pudiendo concluir que una mayor robustez para los datos de la EPA que para los del Censo de 2001.Por último, utilizando la Estadística de Variaciones Residenciales del año 2001 que ofrece datos migratorios interiores clasificados por titulación académica, son calculados indicadores de igualdad y relación como el estadístico de desigualdad U-Theil, el coeficiente de variación de Pearson y el de correlación, entre esta fuente y el censo y la EPA de ese mismo año. Igual que sucedía con las comparaciones previas, el análisis de los estadísticos anteriores reporta unos resultados mejores para los datos de la EPA con respecto a la EVR, que los datos del censo con respecto a esa fuente. Una vez escogida la EPA como fuente estadística para la proyección de flujos migratorios regionales considerando los niveles formativos, se emplea el iterative proportional fitting (IPF) para obtener las matrices migratorias.El procedimiento para la estimación de los flujos migratorios a partir de la información de los microdatos de la EPA es verificado en varias fases. Comenzando por la matriz generada anualmente a partir de los cuatro ficheros de microdatos trimestrales, filtrando la población de 16 y más años que el año anterior cambió de residencia y agregando la información en una matriz de 20×5 filas y 19 columnas, que muestra la estructura de flujos migratorios interregionales y de los inmigrantes procedentes del exterior clasificados en cinco niveles de estudios. A partir de los márgenes totales de esa matriz de microdatos, son calculados unos nuevos por elevación, convergiendo con los valores agregados del módulo de la EPA “Variables de submuestra: personas que han cambiado de residencia hace un año” utilizando un procedimiento de ajuste biproporcional. De esta manera, las migraciones clasificadas por niveles educativos de las NUTS-2 proporcionadas por la muestra de microdatos son elevadas a los valores de los movimientos migratorios de las NUTS-1 ofrecidos por las “Variables de la submuestra”.Antes de esta estimación se testa la correlación de los flujos migratorios agregados de la EPA con los de la EVR. Para esto los datos de la EVR son agregados en NUTS-1, desde los datos originales para NUTS-2. La correlación entre ambas fuentes es muy grande con valores superiores al 95 por cien para la mayoría de los casos, no bajando del 82 por cien. Cada elemento genérico de la matriz de microdatos es denotado por mij/k (t), donde i sería la comunidad de origen (17 comunidades autónomas, 2 ciudades autónomas y el extranjero), j denota la comunidad de destino (17 comunidades autónomas y 2 ciudades autónomas), k el nivel educativo (analfabetos, sin estudios primarios, estudios de primaria, estudios de secundaria y estudios superiores) y t el año concreto de la encuesta (de 2000 a 2017).&nbsp;  Abstract:This paper analyses the statistical strength of population censuses and the
Labour Force Survey (EPA), with the objective of estimating regional matrixes
of migratory flows classified according to education level, by means of
variations of the iterative proportional fitting procedure, based on the
information from such sources.
Official migration statistics do not classify
migrants according to their qualifications, which makes it necessary to use
alternative sources in order to model population movements among autonomous
regions, taking their education into account.
The use of these alternative sources obliges us
to carry out a prior analysis of such, with the objective of selecting the
optimum choice enabling the best estimation of inter-regional migratory flows, taking
into account the migrant population’s different education levels.
We therefore analyse the strength of migratory
information from censuses and the Labour Force Survey, using different
indicators and procedures, in Section 2 of this paper. This is preceded by an
introduction, while Section 1 reviews the literature on qualified migrations.
In this Section 2, we analyse a first
comparison of both sources with the global data from the Residential Variation
Statistics (EVR). This analysis is supplemented by the results of compensatory
equations for Spain as a whole and for each of its autonomous regions, based on
information from the last two population censuses, calculating thereafter the
mean relative errors (ERM) for the population totals and for people with
second- and third-cycle studies, in each of these territories.
The conclusion of these analyses is that the mobility
variable of population censuses is affected by the lack of answers from the
interviewed population and by the imputation method used by the National
Institute of Statistics (INE) to correct it, which explains the underestimation
of census data compared to the Residential Variation Statistics.
Starting
from evaluation reports on the quality of the 2001 Population Census and of the
Labour Force Survey for the years 2006 to 2017, we extracted the results of
several indicators such as the percentage of identically classified, the net difference
rate, the net change index or the gross difference rate, concluding that in all
cases there is greater strength for the Labour Force Survey data than for that
of the 2001 Census.
Finally,
using the Residential Variation Statistics of 2001, which includes interior
migratory data classified according to academic qualifications, we calculated
equality and relation indicators such as Theil’s U index of inequality,
Pearson’s coefficient of variation and that of correlation, between this source
and the census and this same source and the Labour Force Survey of that year.
As in the case of the prior comparisons, the analysis of the previous
statistics produces better results for the Labour Force Survey data compared to
that of the Residential Variation Statistics, than the census data with regard
to this source.
After selecting the Labour Force Survey as the
statistical source for projecting regional migratory flows, taking into account
education levels, in Section 3 of this paper we use the iterative proportional
fitting to obtain migratory matrixes.
The procedure for estimating migratory flows based
on the information from the of the Labour Force Survey micro-data is verified
in several phases. Beginning with the matrix generated annually from the four
files of three-monthly micro-data, filtering the population ages 16 and over
that changed residence the previous year and adding the information in a matrix
with 20×5 rows and 19 columns, which shows the structure of inter-regional
migratory flows and of immigrants from abroad classified in five education
levels. The total margins of this matrix of micro-data are used to calculate
new ones by elevation, converging with the values added from the Labour Force
Survey module “Subsample variables: persons that have changed residence a year
ago,” using a bi-proportional adjustment procedure. In this way, the migrations
classified according to educational level of NUTS-2 provided by the micro-data
sample are elevated to the values of the migration movements of NUTS-1 supplied
by the “Subsample variables.”
Before this estimation, the correlation of the
migratory flows added from the Labour Force Survey with those of the Residential
Variation Statistics are tested for all the years analysed. To that end, the
data of the Residential Variation Statistics is added in NUTS-1, from the
original data for NUTS-2. The correlation between both sources is very high,
with values exceeding 95 per cent in most cases and never lower than 82 per
cent. Each generic element of the micro-data matrix is denoted by m<sub>ij</sub><sub>/k</sub> (t), where i is the region of origin (17 autonomous regions, 2 autonomous
cities and abroad), j denotes the
destination region (17 autonomous regions and 2 autonomous cities), k the education level (illiterate,
without primary education, primary education, secondary education and higher
education) and t the specific year of
the survey (from 2000 to 2017). These matrixes reflect an inter-regional
migratory structure, but the absolute data is limited to the interviewed sample,
which requires an elevation adjustment using the data of the “Labour Force
Survey subsample: Persons that have changed residence a year ago. Persons ages
16 and over that have changed residence a year ago due to education level
attained and place of origin/destination,” whose results are encompassed in 8
NUTS-1 for the places of origin and 7 for those of destination. This elevation
adjustment enables us to calculate the r<sub>i/k</sub>
(t) and s<sub>j/k</sub> (t) margins,
which will make it possible to initiate the iterative proportional fitting
procedure in order to obtain the 18 annual matrixes of the series (2000-2017).
After adjusting the margins of the micro-data
matrix to the absolute levels of the Labour Force Survey subsample, the iterative
proportional fitting procedure can be used to calculate the intermediate elements
of this matrix, making use of the previous margins. The proposed methodology
enables us to attain convergence in the eighteen estimated annual matrixes.
The obtained results favour a broader vision of
the relations among regions, with regard to this type of migrations, and enable
subsequent univariant or multivariant analyses. They could also be examined
using input-output methodology or even under the theory of networks, as
proposed in the last section of this paper along with the conclusions.
Classification-JEL: R1
Keywords: Censo, Migraciones interiores, Iterative Proportional Fitting, Nivel Educativo, Census, Internal Migration, Education Level
Pages: 17-56
Volume: 1
Year: 2021
File-URL: http://www.revistaestudiosregionales.com/documentos/articulos/pdf-articulo-2605.pdf
File-Format: Application/pdf
Handle: RePEc:rer:articu:v:1:y:2021:p:17-56