Nella gestione dei dati il problema dell'#interoperabilità si pone a vari livelli. In questo mese su ShareTIGR rifletteremo sul formato delle trascrizioni eseguite manualmente ascoltando conversazioni audio- e videoregistrate.
Morfologia delle trascrizioni, parte I: leggibili in che modo? https://sharetigr.usi.ch/it/news/feeds/38046
Una #trascrizione prodotta mediante un annotatore multimediale - come ELAN, che abbiamo usato nel progetto InfinIta - contiene del codice informatico che ha bisogno di software specializzato per essere visualizzato e interpretato correttamente. Quando si trasmettono le proprie trascrizioni ad altri/e studiosi/e, conviene perciò chiedersi: Quali applicazioni useranno i/le futuri/e utenti? Quelle applicazioni sapranno leggere i documenti creati dal nostro programma di trascrizione? #FAIRdata#opendata#openresearchdata#spkenlanguage#transcription#italianoparlato#corpuslinguistics @dh @linguistics
Totally busy finalising both my thesis and a future book, but still delighted to receive an invitation for a #CorpusLinguistics book review in a very relevant journal. Will post when published, promise 😃 #academicchatter @phdlife @corpuslinguistics
I kinda gave up on the idea of teaching students command line stuff in the intro to corpus linguistics 😢 , but now I think I can't actually expect any tools other than browser based ones, because many students only have tablets? 😭 dang #corpusLinguistics#teaching
@tschfflr This was one thing that helped me a lot teaching at Saint John's: every full time student was given a laptop on matriculation, paid for by tuition, and financial aid if they qualified. The laptops were maintained by the university IT department until graduation.
That allowed me to assume they all had a certain minimum level of computing power available. I could just say "Bring your laptops on Thursday, we'll be doing corpus analysis!"
Next week I'll be starting a pretty ambitious project—50 Days of LIT Prompts. Every weekday for 10 weeks, I'll be sharing prompt patterns along with my thoughts and readings relating to Large Language Models like those behind #ChatGPT. Follow the link below, and this thread, for updates: https://sadlynothavocdinosaur.com/posts/50-days-of-lit-prompts/
What if I you could select a word or phrase, click a button, and get a definition, be it for a word, idiom, or initialism without leaving the page you're on? Well, I'm happy to say today's prompt template does just that.
It also starts us down the path of understanding how a neural net works!
My erstwhile colleagues working in #CorpusLinguistics would no doubt have something to say about this chart, but it does perhaps tell us something about the growing exhaustion with 'progress' when all around looks like sh*t...
We shouldn't read too much into such shifts in linguistic emphasis (it may be language use rather than material conditions driving the change , but equally its unlikely to be nothing to do with the current & past global & national crises
Question to the #fediverse experts out there: I'd like to host my #CorpusLinguistics and other educational video tutorials on a non-profit platform. What platform would you recommend (bearing in mind that I would have to cover any costs out of my own pocket)? #PeerTube#OpenEducation
My students and I were very lucky to have Dr. Luciana Forti give a guest lecture on "Data-Driven Learning for Italian L2 and beyond" in my seminar on Critical Language #DataLiteracy for the teaching and learning of Romance #languages yesterday. It must have been inspiring for more than just me because I've just had a student come to my office hour to discuss writing a term paper using the Perugia Corpus. ☺️ #CorpusLinguistics#DDL@corpuslinguistics
I am in a love-hate relationship with #DataScience.
Perhaps it wasn't wise to start with quantitative analysis of conversational speech using highly unreliable data as my first big project. #CorpusAnalysis#CorpusLinguistics
@corpuslinguistics For my seminar on corpus building and annotation from web-based data, I am looking for #CorpusLinguistics studies relying on web corpora, in particular of Facebook data, Instagram data, online news comments, podcasts, video tutorials, and online forum posts. Self-promotion welcome! 🙏
How linguists are unlocking the meanings of #Shakespeare 's words using numbers
"Today it would seem odd to describe a flower with the word "bastard"—why apply a term of personal abuse to a flower? But in Shakespeare's time, "bastard" was a technical term describing certain plants."
If you’re curious about “The effect of the reference corpus on mean-frequency measures of lexical sophistication”, come and join @RaffaellaBottini and I for our LANA #CorpusLinguistics online talk.
Okay, here we go! Giving social media another try with #Mastodon since I started to miss the academic community that social media used to provide for me.
I'm Nele, a #linguistics postdoc at the University of Oslo, working on the language of fake news in English. I'm also affiliated with Lund University through my work on the London–Lund Corpus 2 and spoken language.
That's (mainly) what I'll be posting about. Here we go again!