Working Women and Caste in India
Poster at ICLR 2019 AI for Social Good Workshop The story behind the project is interesting: Kuhu, my elder sister, had been reading about how economists have been using machine learning to do interesting things. In particular, she was fascinated by Bertrand and Kamenica’s work on how the US population’s cultural distance might be changing, and this tutorial/paper by Mullainathan and Spiess. During the summer holidays of 2018, she asked me to teach her the basics of machine learning and Python programming. We started brainstorming ideas while we were at it…
We came up with an approch similar to B and K, applying feature attribution to study the generational imact of caste on women’s labour force participation in India. We used Random Forest models on a large nationally-representative dataset (NFHS-4) to predict whether a woman was employed in white-collar/blue-collar jobs or unemployed. Next, we interpretted the trained models using the SHAP feature attribution framework.
Working remotely between New Delhi and Singapore, we submitted our work to the AI for Social Good Workshop at ICLR 2019 without too many expectations. Amazingly, we got accepted as a poster! We were officially co-authors now!
Kuhu travelled to New Orleans to present our poster and got a lot of useful feedback. Being junior researchers, we took ICLR (and my subsequent trip to NeurIPS) as an opportunity to speak to established researchers and students working in the intersection of social policy and machine learning. These conversations helped us understand the limitations of our work, leading us to make an addendum in the arXiv report with additional experiments on the robustness of our findings.