Text Mining in STEM Ed Research

The transition to digital learning has made available new sources of data, providing researchers new opportunities for understanding and improving STEM learning. Data sources such as digital learning environments and administrative data systems, as well as data produced by social media websites and the mass digitization of academic and practitioner publications, hold enormous potential to address a range of pressing problems in STEM Education, but collecting and analyzing text-based data also presents unique challenges. Text mining labs address the following critical questions:

What kinds of text data are valuable?
How can we quantify text data?
What are large language models? How we can use it for qualitative analysis?
What kinds of research questions could be addressed with text data?
How can we set up a research agenda that drives innovations in STEM education research with text data?

Lab 1: Text Mining Basics - Tidy Text & Word Counts as summarized in our Overview Presentation is a gentle introduction to getting our text “tidy” so we can perform some basic word counts, look at words that occur at a higher rate in a group of documents, examine words that are unique to those document groups, and create visualizations such as word cloud. The focus of our Essential Readings and case study in this lab is to help LASER Scholars gain a general understanding of key text mining concepts and terminology, as well as develop a basic comfort level with quantifying text data and working with text data. Our Text mining Case Study: What aspects of online professional development offerings do teachers find most valuable? is guided by the work from Friday Institute and it examined teachers' experiences in professional development. Finally, the Intro to Text Mining Badge provides an opportunity create your own data product and to reflect on how theses concepts and techniques might apply to your own research.

Lab 2: Dictionary-Methods - Twitter Sentiment and School Reform as summarized in our Overview Presentation moves beyond basic concepts of text mining and takes a closer look at a dictionary-based text mining technique, sentiment analysis. Our Essential Readings examine the topic of opinion mining or sentiment analysis. This technique is very helpful for us to understand people's opinions about things such as a policy. Our Text mining Case Study: Do the public like NGSS? investigates the public sentiment expressed toward the Next Generation Science Standards (NGSS) and compares the sentiment for NGSS and Common Core State Standards using twitter data. This study is from Josh's team (https://osf.io/xymsd/). Finally, the Sentiment Analysis Badge provides an opportunity create your own data product and to reflect on how theses concepts and techniques might apply to your own research.

Lab 3: large Language Models for Qualitative Analysis as summarized in our Overview Presentation examines recent advances in using large lanaguge models to classify text data. Through our essential readings, we'll learn about this technique. Our Text mining Case Study: How can we assess students' machine learning literacy automatically? is inspired by the need of assessing machine learning literacy and using automated assessment for real-time intervention in the field of AI education. Finally, the LLM Badge provides an opportunity create your own data product and to reflect on how theses concepts and techniques might apply to your own research.

Lab 4: Topic Modeling in MOOC-Eds as summarized in our Overview Presentation focuses on identifying “topics” by examining how words cohere into different latent, or hidden, themes based on patterns of co-occurrence of words within documents. Our Essential Readings introduces this unsupervised machine learning technique. Our Text mining Case Study: What are participants discussing in forums? is guided by the work from Friday Institute and it explores ideas or issues emerged in the discussion forums in a MOOC-ed course. You can learn more about the work here (https://www.learntechlib.org/p/195234/). Finally, the Topic modeling Badge provides an opportunity create your own data product and to reflect on how theses concepts and techniques might apply to your own research.

Name		Name	Last commit message	Last commit date
Latest commit History 289 Commits
dataset		dataset
image		image
lab-1		lab-1
lab-2		lab-2
lab-3		lab-3
lab-4		lab-4
tm-microcredential		tm-microcredential
tm-overview		tm-overview
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Text Mining in STEM Ed Research

About

Releases

Packages

Languages

yangmohan08/text-mining

Folders and files

Latest commit

History

Repository files navigation

Text Mining in STEM Ed Research

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages