#evaluation - kbin.social

LChoshen, 26 days ago to llm

Do LLMs learn foundational concepts required to build world models? (less than expected)

We address this question with 🌐🐨EWoK (Elements of World Knowledge)🐨🌐

a flexible cognition-inspired framework to test knowledge across physical and social domains

https://ewok-core.github.io

#llm #llms #evaluation #ml #machinelearning

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

vohwinkel, 3 months ago to Wuppertal German

Die Stadt Wuppertal möchte wissen, wie es mit der Bürgerbeteiligung läuft...
https://wupper.link/n4mse

#Wuppertal #Vohwinkel #StadtWuppertal #Stadtverwaltung #Umfrage #Evaluation #Talbeteiligung #Bürgerbeteiligung @wuppertal @wuppertal

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ caos

Sousse, 4 months ago to tunisia French

La #BAD lance la solution informatique “RASME“ en #Tunisie
La #Banque #africaine de #développement (BAD) lance en Tunisie la solution informatique « RASME de collecte, d’analyse et de traitement de données en temps réel » en vue de renforcer la supervision des projets de développement.

https://www.leconomistemaghrebin.com/2024/02/10/bad-lance-solution-informatique-rasme-tunisie/

#Tunisia #Geo-Enabling #géolocalisation #KoBoToolbox #evaluation #Africa #Afrique #Investissement #Investment

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

ParisWriters, 6 months ago to Letters

#Harvard receives another devastating letter from Bill Ackman, detailing the reasons why the university should fire President, Claudine Gay

(‘ In her short tenure as President, Claudine Gay has done more damage to the reputation of Harvard University than any individual in our nearly 500-year history’)

It is a very good letter.

https://twitter.com/BillAckman/status/1733985787455168906

#highered #letters #DEI #antisemitism #reputation #university #scholarship #discrimination #hiring #bestpractices #evaluation

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

nicolay_lilicre, 6 months ago to random French

Je vais faire une #évaluation en travail de #groupe en #6e

Les #rôles seront attitrés et distribués sous forme de carte (cf image jointe).

Ils changeront de rôle toutes les 10 minutes.

J'explique un peu comment va se passer l'évaluation (sans montrer les cartes ^^).

Une élève : Monsieur, c'est quand l'évaluation ?
Moi : Lundi, la prochaine fois qu'on se voit.
La classe : Trop bien !

Trop hâte 😃

J'ai mis le pdf et le odp sur #LaForge , à retrouver ici :
https://forge.aeif.fr/ciaconelli/lmdbt.fr-la-texiotheque/-/tree/main/Divers

:cc: :ccby:

Photos des cartes imprimées et découpées sur des feuilles cartonnées de couleurs pastelles.

reply

expand (12)

collapse (12)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ scudery, LegalizeBrain

nicolay_lilicre, 6 months ago

Bonsoir à tous ! 👋

📝 J'ai rédigé un billet détaillant cette récente #activité pédagogique 👨‍🏫 en classe de #6e :
une forme d'#évaluation en groupe basée sur des #rôles dynamiques pour stimuler la #collaboration et l'#autonomie.

J'y détaille les modalités de "mise en place" et de "règlement".

🔗 Sur #LaForge dans #LaTeXiotheque (faute de mieux pour l'instant 🙃)⤵️
https://ciaconelli.forge.aeif.fr/lmdbt.fr-la-texiotheque/eval_groupe_regles.html

#TeamMaths #TeamProfs #cycle3 #TeamPE

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ lelibreedu

Va, 6 months ago to random French

#TeamPE #Teamprof #evaluation
Je suis vraiment une drôle d'instit. Les résultats de mes CM1 que j'avais déjà au CE2 sont à l'inverse des résultats académiques. Mais alors totalement.

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

Dorianix, 6 months ago to random German

Kann es wirklich sein, dass das #Kolleg für #Elementpädagogik #Augustinum in #Graz keine #Evaluierung von #Lehrveranstaltungen und deren Vortragenden macht?

Weiß dazu jemand was?
Ist das normal?

#Bildungseinrichtung #fail

reply

expand (4)

collapse (4)

report

activity

copy /kbin url

copy original url

open original url

Loading...

linos, 6 months ago

@Dorianix @publicvoit Oft wird vielleicht auch nur zyklisch alle paar Jahre evaluiert. Und an der #KUG haben manche Vorlesungen oder Übungen es geschafft mehr als ein Jahrzehnt ohne aus zu kommen, weil sie leider immer dann, wenn das Institut evaluiert wurde, nicht angeboten wurden. #Evaluation

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

academia_carnet, 9 months ago to random French

#Academiaévalue 🧐 7 septembre 2023
#VeilleESR

@adirlabos fait le bilan de la campagne @Hceres_ et
"s’interroge sur les finalités d’un tel processus bureaucratique d’#évaluation et ne comprend pas que le coût de celui-ci ne soit jamais lui-même évalué"

https://academia.hypotheses.org/51718

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ Gouximan, ma_delsuc

steamworkgroup, 9 months ago to random

Solid opportunity to work with NEA staff on arts program evaluation. Despite the limiting name, it covers all aspects of evaluation, not just analysis. More info is at the link.

Position is in Washington DC.

Salary $94-145k

Closing date Sept 6

https://www.usajobs.gov/job/744614800

#jobs #evaluation #arts

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ cynblogger, Binder

Private

steamworkgroup, 10 months ago

@lisseuse @museum @classicalmusic My potential questions about Orff depend on who I am - the roles, identities, and motivations I'm bringing to my visit. Nevermind the mission of the museum and the intended audiences. (Sorry for the evaluator answer of "it depends," yet it's an honest one!)

#museum #evaluation #CulturalEducation #music

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

stephaniewalter, 11 months ago to accessibility

Top Tips from a Web Accessibility Evaluator at WebAIM: https://webaim.org/blog/top-evaluation-tips/
Great pieces of advice to help you evaluate the accessibility of a website: evaluate piece by piece (header, main regions, footer), search for patterns, create your own checklist to help, etc.
#Accessibility #Evaluation

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

gerald_leppert, 11 months ago to random

#introduction

I am economist & political scientist (PhD) and conduct research on #SocialRiskManagement #ClimateChangeAdaptation #SocialProtection #PublicHealth #GlobalDevelopment #CooperativeStudies.

My current focus is on the topics #ClimateChange #Adaptation, #DisasterRiskManagement, #RuralDevelopment, #HealthCareFinancing, #Multiculturalism.

#ResearchMethods: #MixedMethods & #MethodIntegration, #Quantitative & #Qualitative, #RigorousImpactAssessment, #Causality, #Econometrics, #Evaluation.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ Vittoria, skye

ThunderHoney, 11 months ago to philosophy

Post submission snooze. Thesis is out in the world for evaluation. No editing or writing for a while. Time for some much needed downtime.
#PhDLife #Thesis #Submitted #Writing #Evaluation #Dog #DogsOfMastodon #Nap #PuppyLove #WritingBuddy #Rest #AmEditing #AmWriting #RestAndRecovery #SleepyPuppy

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ riversidebryan, RiversideBryan

ThomasRhysEvans, 11 months ago to psychology

Just finished ‘Evaluating What Works’ by @deevybee & Paul Thompson. A really good primer on how to evaluate #interventions and, despite the speech and language therapy context, I found it highly relevant to #psychology & #metaresearch.

I’ll be adding this to my module’s reading list, but unfortunately I can’t put it on my Goodreads :ablobcatcry:

#evaluation #researchmethods #evidence #openscience

Read online for free:
https://bookdown.org/dorothy_bishop/Evaluating_What_Works/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ lakens

ErikJonker, 1 year ago to ai

Suppose i would like to measure the amount of bias, discrimination, hallucination etc in tools like Bard, Bing, ChatGPT and others. Are there already standards and tools to measure that ?
There will be discussions whether model A is better/worse then model B, it would be nice to have some standards/benchmarks for evaluation ? 🤔
#AI #GenerativeAI #Evaluation #LLM

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

VanessaEr, 1 year ago to food

#introduction
Hello, I’m a #PublicHealth and #nutrition researcher, an #AcademicMum, and a #maker. #Malaysian living in the #UK. Public library champion. Fan of #DIY shows. #Decluttering and #crafting in spare time (if any!)

Research interests include #EthnicMinorityHealth, #evaluation, #CommunityHealth.

I will mainly be tooting about #equity, #decolonisation, #FoodSystem, #Women’sFootball, #ComplexSystems, #ParticipatoryResearch, and #CDoH

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...