Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Copyedit interactive data visualization dashboard #642

Open
wants to merge 4 commits into
base: gh-pages
Choose a base branch
from

Conversation

charlottejmc
Copy link
Collaborator

@charlottejmc charlottejmc commented Nov 19, 2024

I've now prepared my copyedits for en/interactive-data-visualization-dashboard, represented in Issue #609.


Quantitative methods for content analysis (CA) have long been a tradition in mass communication studies, and the method of algorithmic text analysis (ATA) has become popular given the rising availability of large amounts of textual data.[^2] Both methods aim to infer meanings from text through classification or measurement. Whereas CA relies heavily on a carefully crafted codebook based on research questions and multiple human coders to ensure the reliability and validity of a systematic analysis,[^3] [^4] ATA relies on algorithms and models (a more general term for this method is [text mining](https://en.wikipedia.org/wiki/Text_mining) or [natural language processing](https://en.wikipedia.org/wiki/Natural_language_processing)).[^5]
The case study investigates how U.S. television stations have covered the current [war between Russia and Ukraine](https://en.wikipedia.org/wiki/Russo-Ukrainian_War). For example, we could compare the frequency of Ukraine-related keywords to Russia-related keywords employed by different stations. We could also compare the amount of coverage of the war between certain stations.
Copy link
Collaborator Author

@charlottejmc charlottejmc Nov 19, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I've added the wikipedia link https://en.wikipedia.org/wiki/Russo-Ukrainian_War to give context to our readers, but would you prefer the following article instead? https://en.wikipedia.org/wiki/Russian_invasion_of_Ukraine


The approach used in the case study situates somewhere in between CA and ATA. On the one hand, this approach only conducts a distant reading, relying less on human coders often required in CA. On the other hand, this approach only measures the manifest features of text (i.e., frequency) and does not involve the types of algorithmic classification that is often seen in ATA. This approach of distant reading aims to discover patterns from large amount of data.[^6]
Mass communication studies have traditionally relied on quantitative Content Analysis (CA) methods. However, algorithmic text analysis (ATA) has also recently grown in popularity, due to the rising availability of large amounts of textual data.[^2] Both types of methods aim to infer meanings from text through classification or measurement. Whereas CA relies heavily on a carefully crafted codebook, built around research questions and verified by multiple human coders,[^3] [^4] ATA relies on computational methods like statistics and machine learning. You might have heard for example of [text mining](https://en.wikipedia.org/wiki/Text_mining), or [Natural Language Processing](https://en.wikipedia.org/wiki/Natural_language_processing).[^5]
Copy link
Collaborator Author

@charlottejmc charlottejmc Nov 19, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I reworded this paragraph quite a bit, so I wanted to check with you that it still expresses your original meaning. I was a little confused by the first sentence especially. I also didn't think 'text mining' or 'natural language processing' could really be called 'more general terms' for ATA – the latter sounds like a much more general term, of which the former two would be specific instantiations! What do you think?


Second, scroll down and find the section called "Environment Variables." Click "Add Environment Variable" and input `PYTHON_VERSION` as the key and the Python version that you use on your machine as the value (use `$python -V` in command line to check your Python version). If you don't explicitly specify the Python version this way, Render will use Python 3.7 as default, which may cause conflicts between this old Python version and the libraries' versions specified in `requirements.txt`.
To demonstrate the wide applicability of the approach used in the case study above, this lesson will show you how to create another dashboard, using a different dataset. This second example explores a different research question: 'What were the top 10 languages used by non-English U.S. newspapers by decade from the 1690s to today?' We'll design a dashboard to show the top ten languages in each decade dating back to the 1690s, highlighting any shifts in their rankings, and the emergence or decline of different languages over time.
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I tried changing the wording of the second research question here, but I still think it is very tricky to try to express your particular question in a clear way. Do you prefer the original? Maybe we can brainstorm some options...

@charlottejmc
Copy link
Collaborator Author

Hello @hluling and @caiocmello,

I've now prepared the copyedits for this lesson. I'd be grateful if you could review the adjustments and confirm that you are happy for me to merge these. You can see the details of my edits under the files changed tab!

I'd like to bring your attention to the comments I have attached to specific lines. You can respond to any of my suggestions via the comments below, or click Resolve conversation if you're happy with them like this:

image

If you want to make edits, please work here by clicking the three dots at the upper right of the file, then Edit file:

image

@charlottejmc
Copy link
Collaborator Author

charlottejmc commented Nov 19, 2024

One other thing, @hluling: your lesson will need a very short abstract (1-2 sentences max) to give a rapid overview of what it teaches. Could you prepare this and either send it to me via a comment, or add it directly in this branch to the abstract field in the markdown file's YAML header? Thank you!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant