Trending

#DataSets

Latest posts tagged with #DataSets on Bluesky

Latest Top
Trending

Posts tagged #DataSets

On the Importance of Pretraining Data Alignment for Atomic Property Prediction

Yasir M. Ghunaim, Hasan Abed Al Kader Hammoud, Bernard Ghanem

Action editor: Changyou Chen

https://openreview.net/forum?id=jfD9BsrDTb

#dataset #datasets #inception

0 0 0 0

But large #datasets bring challenges:
• Bias in digital data sources
• Measurement validity issues
• Risks of overfitting models

Therefore, validation and replication are essential in CSS research.

0 0 1 0
resumen ejecutivo del informe de datasets españoles en Zenodo

resumen ejecutivo del informe de datasets españoles en Zenodo

Ya está publicado el informe de #datasets de universidades españolas en #Zenodo con datos de diciembre-2025. Más conjuntos pero menor nivel de descripción. No se debe bajar la guardia. Las bibliotecas universitarias algo deben de hacer. www.javima.info/ciencia-abie...
#CienciaAbierta

0 0 0 0
Eye-Tracking-While-Reading Datasets

👀 📣 To all users of eye-tracking-while-reading datasets: check out our comprehensive, filterable dataset overview!

Dataset overview: dili-lab.github.io/datasets.html

Preprint: arxiv.org/abs/2602.19598

Add or edit your dataset: www.cl.uzh.ch/en/research-...

#FAIR #eyetracking #datasets

2 1 0 0
Preview
Scientists warn fake research is spreading faster than real science A sweeping new study from Northwestern University reveals that scientific fraud is no longer just the work of a few rogue researchers—it has evolved into a global, organized enterprise. By analyzing…

"By analyzing massive #datasets .. #researchers uncovered networks involving “paper mills,” brokers, and compromised journals that systematically produce and sell fake #research, authorship slots, and #citations.": buff.ly/YJ4bqBU

via sciencedaily
#science #MedSky #research #ResearchJournals

7 4 0 0
Preview
Why Austria? A Prime Telemarketing Goldmine In today's fast-paced digital landscape, businesses are constantly seeking efficient ways to connect with high-value leads. For marketers ta...

Enter 100% verified active #AustriaWhatsApp #numberdata from trusted #WhatsAppDatabase companies. These premium #datasets offer a #gamechanging solution for #telemarketing and direct call marketing #campaigns, delivering unmatched accuracy, and ROI
buywhatsappdatabase247.blogspot.com/2026/03/aust...

0 0 0 0
Post image

The scryptIQ #machinelearning module covers both supervised and unsupervised learning methods: namely the classification and clustering of different #biological #datasets, including images.

scryptiq.ai

0 0 0 0
Science is more than papers

Science is more than papers

153M+ research outputs in the #OpenAIREGraph are linked to #datasets & #software
A growing web of connections allowing us to see how knowledge is built across publications, data & code, not just the final paper.
Explore connections
🔗 #GraphAPI shorturl.at/oRotk
🔗 #OpenAIRE EXPLORE shorturl.at/RIZoh

2 1 0 0

New #J2C Certification:

Probabilistic Pretraining for Improved Neural Regression

Boris N. Oreshkin, Shiv Kumar Tavker, Dmitry Efimov

https://openreview.net/forum?id=F6BTATGXaf

#datasets #tabpfn #regression

0 0 0 0
Post image

BGS' BritPits map shows the distribution of worked mineral commodities across the UK - tinyurl.com/5ydmtaf6

#Aspermont #BritishGeologicalSurvey #BritPits #MineralResources #MineralPlanningAuthority #Geology #Datasets

0 0 0 0
Post image

From Reflection to Repair: A Scoping Review of Dataset Documentation Tools" (new preprint via ArXiv) arxiv.org/abs/2602.15968 #data #datasets #rdm

0 1 0 0
Post image

Discussing AI in the sphere of geological modelling with respect to the tunnelling industry - tinyurl.com/54bxc7bs

#Aspermont #COWIfonden #UniversityofStrathclyde #TechnicalUniversityofDenmark #COWI #AI #Tunnelling #GroundInvestigation #DataSets #GeologicalModelling

0 0 0 0
Preview
Automatic classification of research data sets into the Chinese Library Classification with generative large language model Purpose. Research data sets are typically distributed across different data repositories and lack standardized classification information, which hinders effective discovery and access. This study aims...

How can AI classify multilingual research datasets?

doi.org/10.1108/EL-0...

Why read? It shows a practical pipeline using a fine-tuned Qwen2 to assign CLC codes to multilingual datasets.
Next step: More detailed cross-language evaluation (authors).

#ShortReview #AI #LLM #Classification #Datasets

1 0 1 0
Post image

Industry holds some of the richest #ocean #datasets — yet only 3% reach global #biodiversity repositories (Tides of Transparency, 2024).
📺 Ocean Literacy Webinar 2
🗓️ 17 March 2026 | Online

Register now on our website! 🔗 tinyurl.com/3993rj9t

0 2 0 1

#agentarium
#intelligence_module
#cognitive_infrastructure
#vdb
#ai
#data
#datasets
#agenticai
#rag
#graphrag

1 0 0 0

Occam’s Razor for SSL: Memory-Efficient Parametric Instance Discrimination

Eric Gan, Patrik Reizinger, Alice Bizeul et al.

Action editor: Georgios Leontidis

https://openreview.net/forum?id=GFNTbsVFlP

#supervised #regularization #datasets

0 0 0 0

1) Do #datasets have #DOIs? How are #data cited?

"At Pensoft we can do it in 2 ways: authors can cite both Data Papers and/or #Dataset. We recommend to cite both, and this is in our opinion the right way to do that" - Prof. Penev.

#lovedata26

@lovedataweek.bsky.social

3 4 1 0
Post image

AllenAI Introduces #AutoDiscovery: Automated Scientific Discovery Now Available in Asta Labs allenai.org/blog/autodis... #AI #datasets #data @ai2.bsky.social #research

1 0 0 0
List of Ethical Requirements for the study "Co-Design of a Trustworthy AI-based Prognostic Tool for Predicting Patient Outcome in Acute Stroke" The data was collected as part of the study “Co-Design of a Trustworthy AI-based Prognostic Tool for Predicting Patient Outcome in Acute Stroke.”  It includes ethical requirements and the associated d...

List of Ethical Requirements for the study "Co-Design of a Trustworthy AI-based Prognostic Tool for Predicting Patient Outcome in Acute Stroke" zenodo.org/records/1848... #hvhebron #datasets #neuro [Text complet]

0 0 0 0
Preview
Digitalisierung als Chance für Frauen in MINT (digiMINT) Jeanrenaud, Yves; Wimmer, Anna-Kathrin; Bässler, Katharina: Digitalisierung als Chance für Frauen in MINT (digiMINT) [dataset]. Qualiservice, PANGAEA, https://doi.pangaea.de/10.1594/PANGAEA.990048 (da...

I pre-registered our #qualitative #datasets: Jeanrenaud et al. (in review): Digitalisierung als Chance für Frauen in MINT (digiMINT) doi.pangaea.de/10.1594/PANG...

1 0 0 0
Post image

Webinars | This morning, we hosted a webinar on the HiQLCD database version 1.4.0, together with the HiQLCD team.

If you missed it, the recording is available on our YouTube channel: www.youtube.com/watch?v=73jg...

#openLCA #webinar #database #chinese #update #datasets #lifecycleassessment

1 0 0 0
Post image

Real Estate Data Explained - Examples, Datasets & Top Providers

The real estate market is expected to grow at an annual rate of 2.69% (CAGR 2025-2029).

Learn about real restate data and top providers: www.hitechbpo.com/blog/real-es...

#realestatedata #datasets #realestatedataprovider

0 0 0 0
Preview
Creative Dataset Maker CreativeDatasetMaker – AI-Powered Dataset GenerationNeed high-quality datasets but don't have the time or resources to create them manually? Say hello to CreativeDatasetMaker, the AI-powered tool that...

Do you want to create synthetic datasets for your AI projects? Try Creative Dataset Maker. Creative Dataset Maker let you use OpenAI LLMs to create datasets that could be useful for creating realistic datasets.

#AI #datasets #LLM

zerooneeta.gumroad.com/l/hhlwt

0 1 0 0
Post image

Resilience in Times of Crisis: Strengthening Open Science Against Geopolitical Pressures (via @leidenmadtrics.bsky.social) www.leidenmadtrics.nl/articles/res... #openscience #datasets @datarescueproject.org

1 0 0 0
Post image

On a Related Note...
#Guidelines and Best Practices For Making Government #Datasets Ready For #AI (via Gov of UK) www.gov.uk/government/p... #bestpractices #data

0 0 0 0
Post image

ICYMI UPDATE on the UK Government Effort to Create a National #Data Library www.gov.uk/government/p... #datasets

2 1 1 0

SPONGE: Competing Sparse Language Representations for Effective Knowledge Transfer

Jens-Michalis Papaioannou, Alexei Figueroa, Conor Fallon et al.

Action editor: Changjian Shui

https://openreview.net/forum?id=OevFdPgk3h

#nlp #annotated #datasets

0 1 0 0
FetchSeries - Economic and Financial Data Please enter a search query.

FetchSeries - www.fetchseries.com
Freely Downloadable Data Sets Updated Daily

#Datasets #Commodities #Transport #EnvironmentAndClimate #Finance #Macroeconomics

1 0 0 0
Preview
nycOpenData: A unified R interface to NYC Open Data APIs Discover nycOpenData, an R package offering tidy, reproducible access to NYC Open Data APIs for teaching, research, and civic data analysis.

#OpenData #datasets #NYC #R

'I am pleased to announce the release of nycOpenData, an R package providing convenient, tidy access to dozens of datasets from the New York City Open Data platform.'

statsandr.com/blog/nycopen...

0 0 0 0

Who is mapping Trump’s Gulag prison system in USA and elsewhere?

#Datasets #dataviz #queries

0 0 1 0