Logo

Data Science Colloquium FY21 - Shared screen with speaker view
micah saxton
48:36
https://colab.research.google.com/drive/1m9er740u-D9Gw2ZeWfBfa2N7iu3CQToL?usp=sharing
Raghav Sharma
55:01
would it be okay to save this Colab to GitHub so people without G-Suite can access it?
micah saxton
55:16
Yes Raghav, that's fine
Raghav Sharma
55:24
thank you
Micah Bailey Arafah
01:05:05
medical texts to count mentions of gender, categorized by illness/disease/condition?
Timothy Beal
01:05:27
fantastic idea Micah
Cathie Kelsey
01:09:48
Could this method show different language used by one author for different audiences or in different periods?
Micah Bailey Arafah
01:09:59
Nice Cathie
Raghav Sharma
01:10:04
ooh!
Cathie Kelsey
01:10:33
Exactly
Brian Clites
01:11:20
This is definitely exciting. Your example focuses on books. But you could create your own spreadsheets out of archival news stories, letters, etc, right? (i.e. books are common, but any text will work)
Timothy Beal
01:11:35
Interesting how quickly the research ideas/questions move to design questions …
Brian Clites
01:12:02
Awesome. Thanks!
David Busch
01:12:35
Yes, corpora design is where the most work is!
Micah Bailey Arafah
01:19:58
Am I thinking of this correctly in that this can help analyze texts to determine trends (ie LGBTQ+ terms, white supremacy terms)?
micah saxton
01:21:58
Yes, Micah. That would be a great project!
Micah Bailey Arafah
01:23:38
thank you Micah
micah saxton
01:27:24
Note: your cell number may not be the same
micah saxton
01:30:36
I heart Asimov
Brian Clites
01:30:37
isaac
Timothy Beal
01:31:30
Note that the 0th index number is the 1st in Python
Brian Clites
01:33:08
men. obits.
Brian Clites
01:34:06
i see that all the results are lowercase. Can you exclude proper nouns from Tfidf?
Justin Barber
01:35:33
You could exclude them in a separate step, yes. You may need to use named entity recognition (to identify names).
Cathie Kelsey
01:41:22
It looks as though beginners most easily start with questions of “what can I do with this?
Brian Gran
01:41:33
Thank you. I have another meeting now. I will watch the recording. Great seminar and terrific opportunity.
Michael Hemenway
01:41:34
yes!
Michael Hemenway
01:41:43
thanks Brian
Cathie Kelsey
01:41:43
And advanced folks start with what they want to figure out?
Michael Hemenway
01:42:30
Cathie, yes. first, i think we have to see some possibilities and then to imagine how it might relate to a research project i am already working on.
Timothy Beal
01:42:33
Good point Cathie!
Michael Hemenway
01:42:42
then as we work with the tools more, new questions emerge.
Timothy Beal
01:43:37
yes. iterative emergence
Michael Hemenway
01:43:56
yes!
Micah Bailey Arafah
01:49:39
apostrophe
Brian Clites
01:58:14
what are the empty parentheses for after title in the example?
Micah Bailey Arafah
01:58:41
no quotes
Michael Hemenway
01:58:46
great question Brian. the parens after title indicate that that is a method
Michael Hemenway
01:59:00
so it is an action that is taken on the string.
Michael Hemenway
01:59:46
.title() - theoretically, we could pass arguments inside the parentheses
Michael Hemenway
02:00:25
alternatively, strings can have properties that are not actions, those would come after the dot notation without the empty parens
Raghav Sharma
02:04:53
would removing index space 2 look like nums.remove(nums[2]) ?
Michael Hemenway
02:06:44
yep
Michael Hemenway
02:07:14
remove method removes first element that matches what is passed.
Michael Hemenway
02:07:47
so, i believe, if your list had same value also at index 0, actually it would remove the index 0 item, not item 2.
Michael Hemenway
02:08:24
.pop() method will remove by index.
Fred Belcavello
02:09:18
Thanks for the great workshop, but, unfortunately, I have to leave!
Michael Hemenway
02:09:37
ciao Fred!
Mark Turner
02:12:03
Thank you, Micah and Michael, for a superb introduction.
William Deal
02:12:21
Thanks Micah and Michael! Very helpful presentation!
Michael Hemenway
02:12:22
Thanks Mark!
Michael Hemenway
02:12:32
Thanks Bill!
David Busch
02:12:48
Thanks Micah and Michael! Very informative!
Mark Turner
02:12:56
The collar link is already up.
Michael Hemenway
02:12:58
Thanks David!
Brian Clites
02:13:08
Yes, Thank you Michael & Micah!
Raghav Sharma
02:13:10
thanks Micah and Michael!
Micah Bailey Arafah
02:13:12
I was so intimidated- thank you for making it so accessible
Michael Hemenway
02:13:34
Thank you Brian, Raghav, and Micah!
Cathie Kelsey
02:13:38
Thank you, Micah and Michael. Inspiring! ;-)
Kailey Zaronias
02:13:40
Thank you very much! This was very helpful and I look forward to learning more.
Michael Hemenway
02:13:47
I will add our contact info to the colab notebook.
Michael Hemenway
02:13:55
thanks Cathie and Kailey