Copyedit interactive data visualization dashboard #642

charlottejmc · 2024-11-19T09:42:29Z

I've now prepared my copyedits for en/interactive-data-visualization-dashboard, represented in Issue #609.

Copyedit to 492

Copyedit to end

Copyedits round 2

charlottejmc · 2024-11-19T09:44:50Z

en/drafts/originals/interactive-data-visualization-dashboard.md


-Quantitative methods for content analysis (CA) have long been a tradition in mass communication studies, and the method of algorithmic text analysis (ATA) has become popular given the rising availability of large amounts of textual data.[^2] Both methods aim to infer meanings from text through classification or measurement. Whereas CA relies heavily on a carefully crafted codebook based on research questions and multiple human coders to ensure the reliability and validity of a systematic analysis,[^3] [^4] ATA relies on algorithms and models (a more general term for this method is [text mining](https://en.wikipedia.org/wiki/Text_mining) or [natural language processing](https://en.wikipedia.org/wiki/Natural_language_processing)).[^5] 
+The case study investigates how U.S. television stations have covered the current [war between Russia and Ukraine](https://en.wikipedia.org/wiki/Russo-Ukrainian_War). For example, we could compare the frequency of Ukraine-related keywords to Russia-related keywords employed by different stations. We could also compare the amount of coverage of the war between certain stations. 


I've added the wikipedia link https://en.wikipedia.org/wiki/Russo-Ukrainian_War to give context to our readers, but would you prefer the following article instead? https://en.wikipedia.org/wiki/Russian_invasion_of_Ukraine

charlottejmc · 2024-11-19T09:46:21Z

en/drafts/originals/interactive-data-visualization-dashboard.md


-The approach used in the case study situates somewhere in between CA and ATA. On the one hand, this approach only conducts a distant reading, relying less on human coders often required in CA. On the other hand, this approach only measures the manifest features of text (i.e., frequency) and does not involve the types of algorithmic classification that is often seen in ATA. This approach of distant reading aims to discover patterns from large amount of data.[^6] 
+Mass communication studies have traditionally relied on quantitative Content Analysis (CA) methods. However, algorithmic text analysis (ATA) has also recently grown in popularity, due to the rising availability of large amounts of textual data.[^2] Both types of methods aim to infer meanings from text through classification or measurement. Whereas CA relies heavily on a carefully crafted codebook, built around research questions and verified by multiple human coders,[^3] [^4] ATA relies on computational methods like statistics and machine learning. You might have heard for example of [text mining](https://en.wikipedia.org/wiki/Text_mining), or [Natural Language Processing](https://en.wikipedia.org/wiki/Natural_language_processing).[^5] 


I reworded this paragraph quite a bit, so I wanted to check with you that it still expresses your original meaning. I was a little confused by the first sentence especially. I also didn't think 'text mining' or 'natural language processing' could really be called 'more general terms' for ATA – the latter sounds like a much more general term, of which the former two would be specific instantiations! What do you think?

charlottejmc · 2024-11-19T09:56:47Z

en/drafts/originals/interactive-data-visualization-dashboard.md


-Second, scroll down and find the section called "Environment Variables." Click "Add Environment Variable" and input `PYTHON_VERSION` as the key and the Python version that you use on your machine as the value (use `$python -V` in command line to check your Python version). If you don't explicitly specify the Python version this way, Render will use Python 3.7 as default, which may cause conflicts between this old Python version and the libraries' versions specified in `requirements.txt`.
+To demonstrate the wide applicability of the approach used in the case study above, this lesson will show you how to create another dashboard, using a different dataset. This second example explores a different research question: 'What were the top 10 languages used by non-English U.S. newspapers by decade from the 1690s to today?' We'll design a dashboard to show the top ten languages in each decade dating back to the 1690s, highlighting any shifts in their rankings, and the emergence or decline of different languages over time. 


I tried changing the wording of the second research question here, but I still think it is very tricky to try to express your particular question in a clear way. Do you prefer the original? Maybe we can brainstorm some options...

Minor edit

charlottejmc · 2024-11-19T10:00:22Z

Hello @hluling and @caiocmello,

I've now prepared the copyedits for this lesson. I'd be grateful if you could review the adjustments and confirm that you are happy for me to merge these. You can see the details of my edits under the files changed tab!

I'd like to bring your attention to the comments I have attached to specific lines. You can respond to any of my suggestions via the comments below, or click Resolve conversation if you're happy with them like this:

If you want to make edits, please work here by clicking the three dots at the upper right of the file, then Edit file:

charlottejmc · 2024-11-19T10:03:49Z

One other thing, @hluling: your lesson will need a very short abstract (1-2 sentences max) to give a rapid overview of what it teaches. Could you prepare this and either send it to me via a comment, or add it directly in this branch to the abstract field in the markdown file's YAML header? Thank you!

charlottejmc added 3 commits November 13, 2024 16:23

Update interactive-data-visualization-dashboard.md

bc461a7

Copyedit to 492

Update interactive-data-visualization-dashboard.md

2613896

Copyedit to end

Update interactive-data-visualization-dashboard.md

74d095f

Copyedits round 2

charlottejmc added English 6. Sustainability + Accessibility Original labels Nov 19, 2024

charlottejmc self-assigned this Nov 19, 2024

charlottejmc commented Nov 19, 2024

View reviewed changes

Update interactive-data-visualization-dashboard.md

f2fb4d1

Minor edit

charlottejmc mentioned this pull request Nov 19, 2024

Creating a Dashboard for Interactive Data Visualization with Dash in Python #609

Open

anisa-hawes requested a review from hluling November 20, 2024 13:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Copyedit interactive data visualization dashboard #642

Copyedit interactive data visualization dashboard #642

charlottejmc commented Nov 19, 2024 •

edited

Loading

charlottejmc Nov 19, 2024 •

edited

Loading

charlottejmc Nov 19, 2024 •

edited

Loading

charlottejmc Nov 19, 2024

charlottejmc commented Nov 19, 2024

charlottejmc commented Nov 19, 2024 •

edited by anisa-hawes

Loading


		Quantitative methods for content analysis (CA) have long been a tradition in mass communication studies, and the method of algorithmic text analysis (ATA) has become popular given the rising availability of large amounts of textual data.[^2] Both methods aim to infer meanings from text through classification or measurement. Whereas CA relies heavily on a carefully crafted codebook based on research questions and multiple human coders to ensure the reliability and validity of a systematic analysis,[^3] [^4] ATA relies on algorithms and models (a more general term for this method is [text mining](https://en.wikipedia.org/wiki/Text_mining) or [natural language processing](https://en.wikipedia.org/wiki/Natural_language_processing)).[^5]
		The case study investigates how U.S. television stations have covered the current [war between Russia and Ukraine](https://en.wikipedia.org/wiki/Russo-Ukrainian_War). For example, we could compare the frequency of Ukraine-related keywords to Russia-related keywords employed by different stations. We could also compare the amount of coverage of the war between certain stations.


		The approach used in the case study situates somewhere in between CA and ATA. On the one hand, this approach only conducts a distant reading, relying less on human coders often required in CA. On the other hand, this approach only measures the manifest features of text (i.e., frequency) and does not involve the types of algorithmic classification that is often seen in ATA. This approach of distant reading aims to discover patterns from large amount of data.[^6]
		Mass communication studies have traditionally relied on quantitative Content Analysis (CA) methods. However, algorithmic text analysis (ATA) has also recently grown in popularity, due to the rising availability of large amounts of textual data.[^2] Both types of methods aim to infer meanings from text through classification or measurement. Whereas CA relies heavily on a carefully crafted codebook, built around research questions and verified by multiple human coders,[^3] [^4] ATA relies on computational methods like statistics and machine learning. You might have heard for example of [text mining](https://en.wikipedia.org/wiki/Text_mining), or [Natural Language Processing](https://en.wikipedia.org/wiki/Natural_language_processing).[^5]


		Second, scroll down and find the section called "Environment Variables." Click "Add Environment Variable" and input `PYTHON_VERSION` as the key and the Python version that you use on your machine as the value (use `$python -V` in command line to check your Python version). If you don't explicitly specify the Python version this way, Render will use Python 3.7 as default, which may cause conflicts between this old Python version and the libraries' versions specified in `requirements.txt`.
		To demonstrate the wide applicability of the approach used in the case study above, this lesson will show you how to create another dashboard, using a different dataset. This second example explores a different research question: 'What were the top 10 languages used by non-English U.S. newspapers by decade from the 1690s to today?' We'll design a dashboard to show the top ten languages in each decade dating back to the 1690s, highlighting any shifts in their rankings, and the emergence or decline of different languages over time.

Copyedit interactive data visualization dashboard #642

Are you sure you want to change the base?

Copyedit interactive data visualization dashboard #642

Conversation

charlottejmc commented Nov 19, 2024 • edited Loading

charlottejmc Nov 19, 2024 • edited Loading

Choose a reason for hiding this comment

charlottejmc Nov 19, 2024 • edited Loading

Choose a reason for hiding this comment

charlottejmc Nov 19, 2024

Choose a reason for hiding this comment

charlottejmc commented Nov 19, 2024

charlottejmc commented Nov 19, 2024 • edited by anisa-hawes Loading

charlottejmc commented Nov 19, 2024 •

edited

Loading

charlottejmc Nov 19, 2024 •

edited

Loading

charlottejmc Nov 19, 2024 •

edited

Loading

charlottejmc commented Nov 19, 2024 •

edited by anisa-hawes

Loading