In the summer of 2025, I had the privilege of participating in the China Rural Development Survey (CRDS) project, led by Professor Linxiu Zhang and under the guidance of Dr. Yunli Bai.
This longitudinal research project, initiated in 2005, has been continuously tracking development trends in rural China and has accumulated nearly two decades of valuable data. My primary responsibility involved systematically compiling and coding village-level questionnaires from 2005 to 2023 to support subsequent research on rural development.
During the initial phase of the internship, Dr. Bai Yunli first provided me with a comprehensive introduction to the overall framework of the CRDS project, while emphasizing the crucial importance of standardized data coding. My work was primarily divided into three stages:
The first stage involved systematic data compilation. I meticulously reviewed all waves of village-level questionnaires dating back to 2005, systematically categorizing the survey questions across different years. Given that the survey content had been adjusted in response to evolving policies and socioeconomic developments, the questionnaires exhibited certain variations in question design and phrasing across different years. To address this, I created a cross-year question comparison table to methodically classify and organize similar or related questions.
The second stage involved developing a coding framework based on the organized questionnaire items. However, this process presented several significant challenges, such as the same research question might be phrased differently across survey years, measurement methodologies for certain indicators evolved over time, some questions from earlier questionnaires were either discontinued or substantially modified in subsequent survey waves.
The third stage focused on code implementation and validation. After finalizing the coding scheme, I proceeded with the actual data coding process. To ensure coding quality and consistency, I developed a coding system that documented each code’s definition and applicable scope in detail, conducted repeated verification of key variables to minimize errors and held regular discussions with Dr. Bai to resolve ambiguities and refine coding decisions.
This internship has significantly enhanced my abilities in multiple aspects. Professionally, I acquired skills in processing longitudinal data and learned to design systematic coding schemes. In terms of research capabilities, this experience cultivated a rigorous academic mindset, sharpened my problem-identifying and problem-solving skills, and strengthened my ability to communicate research effectively. On a personal level, the internship deepened my appreciation for the precision and systematic nature of academic work—every coding decision could impact the final research outcomes, demanding meticulous attention. Additionally, I gained a profound understanding of the importance of teamwork in scientific research.
Finally, I would like to express my deepest gratitude to Dr. Yunli Bai for her dedicated guidance throughout this internship. Dr. Bai’s rigorous academic attitude and patient mentorship have been invaluable to my growth. This opportunity has not only allowed me to accumulate invaluable research experience but also deepened my understanding of my field of study.
Moving forward, I aspire to uphold the same spirit of exploration and truth-seeking pragmatism, contributing my efforts to China’s development. This internship will remain a pivotal milestone in my academic journey, and I will forever cherish this invaluable learning experience.
Author: Jingyi Li, Undergraduate Student in Sociology, Nanjing Agricultural University.