Trending
Santiago Castro's Avatar

Santiago Castro

@bryant1410

πŸ‡ΊπŸ‡Ύ Research Scientist @Netflix, working on Vision+Language research. Opinions are my own.

213
Followers
531
Following
1
Posts
10.08.2023
Joined
Posts Following

Latest posts by Santiago Castro @bryant1410

https://tinyurl.com/BristolCVLectureship

Pls RT
Permanent Assistant Professor (Lecturer) position in Computer Vision @bristoluni.bsky.social [DL 6 Jan 2025]
This is a research+teaching permanent post within MaVi group uob-mavi.github.io in Computer Science. Suitable for strong postdocs or exceptional PhD graduates.
t.co/k7sRRyfx9o
1/2

04.12.2024 17:22 πŸ‘ 22 πŸ” 14 πŸ’¬ 1 πŸ“Œ 1
A screenshot showing the usage quota of my repositories storage, which is 5x more than full.

A screenshot showing the usage quota of my repositories storage, which is 5x more than full.

HuggingFace is limiting repositories' storage 😱

02.12.2024 19:21 πŸ‘ 4 πŸ” 0 πŸ’¬ 0 πŸ“Œ 1

Can all graduate programs please accept a universal letter system like Interfolio so we don’t have to upload 100 letters individually?! The time waste is insane.

Students are telling me that only *two* of their applications accept Interfolio!

01.12.2024 19:20 πŸ‘ 4 πŸ” 2 πŸ’¬ 0 πŸ“Œ 0

A librarian that previously worked at the British Library created a relatively small dataset of bsky posts, hundreds of times smaller than previous researchers, to help folks create toxicity filters and stuff.

So people bullied him & posted death threats.

He took it down.

Nice one, folks.

28.11.2024 05:33 πŸ‘ 583 πŸ” 59 πŸ’¬ 28 πŸ“Œ 11

Personally, reviewing for NeurIPS a couple years back changed me as a reviewer. For one paper I rejected, I kept citing it throughout the year to people for a finding it had. This made me realise it was a good paper, it just had some easy targets for rejection.

27.11.2024 17:25 πŸ‘ 67 πŸ” 8 πŸ’¬ 2 πŸ“Œ 1

Do you know what rating you’ll give after reading the intro? Are your confidence scores 4 or higher? Do you not respond in rebuttal phases? Are you worried how it will look if your rating is the only 8 among 3’s? This thread is for you.

27.11.2024 17:25 πŸ‘ 77 πŸ” 20 πŸ’¬ 4 πŸ“Œ 3
Preview
CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models We present CAT4D, a method for creating 4D (dynamic 3D) scenes from monocular video. CAT4D leverages a multi-view video diffusion model trained on a diverse combination of datasets to enable novel vie...

We just dropped CAT4D, text to dynamic 3D models that you can render in real time. Not posting a video because Bluesky is garbage in this respect; go straight to the real time viewer on a desktop browser and look around. The cat kneading dough is my favorite.
cat-4d.github.io

28.11.2024 02:50 πŸ‘ 114 πŸ” 11 πŸ’¬ 3 πŸ“Œ 1
Post image
28.11.2024 01:49 πŸ‘ 3 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0

In the HuggingFace/Bluesky incident, the problem goes deeper than whether the data is "public" or "private"

What matters to people is whether their data was collected, which data was collected, how it may be used, and who it may be used by

27.11.2024 14:57 πŸ‘ 118 πŸ” 27 πŸ’¬ 3 πŸ“Œ 8

ACL syntax track reviewers >> almost any other conference.

These folks care about their sub-field and i learn something new every time!

27.11.2024 19:44 πŸ‘ 12 πŸ” 2 πŸ’¬ 1 πŸ“Œ 1
Post image

On July 26th, Nancy Pelosi sells 5000 shares of $MSFT Microsoft,

On Nov 27, 2024, the FTC announces it is launching a wide-ranging US antitrust probe against $MSFT.

This was Nancy Pelosi's largest sell in two years of her portfolio, with $MSFT below her sell now.

27.11.2024 23:44 πŸ‘ 194 πŸ” 26 πŸ’¬ 20 πŸ“Œ 3

We are looking for the current best multi-view full-body 3d pose estimation model/software with Remi Cadene

Any good advice?

Should include hands pose estimation in addition to body preferably

Better if able to use multiple cameras as inputs (multi-view)

for open-source low cost robot teleop

27.11.2024 22:17 πŸ‘ 20 πŸ” 3 πŸ’¬ 4 πŸ“Œ 0
I'm standing next to the poster holding a dice tray with a blue dice inside

I'm standing next to the poster holding a dice tray with a blue dice inside

Today I presented my MSc. work "Exploring approaches to Improvisational Interactive Storytelling" in the student seminar.

I narrated a basic setting and used a dice to explain the gamemastering mechanisms to the committee ☺️

OK, now I have to write my thesis! πŸ˜…

27.11.2024 16:31 πŸ‘ 5 πŸ” 2 πŸ’¬ 0 πŸ“Œ 0

Check out our new work on video-guided audio gen with a focus on fine-grained creative control! Done by @czyang.bsky.social during an internship with our group at Adobe Research. Super fun model!

27.11.2024 03:00 πŸ‘ 10 πŸ” 2 πŸ’¬ 0 πŸ“Œ 0
Video thumbnail

πŸŽ₯ Introducing MultiFoley, a video-aware audio generation method with multimodal controls! πŸ”Š
We can
⌨️Make a typewriter sound like a piano 🎹
🐱Make a cat meow like a lion roars! 🦁
⏱️Perfectly time existing SFX πŸ’₯ to a video.

arXiv: arxiv.org/abs/2411.17698
website: ificl.github.io/MultiFoley/

27.11.2024 02:58 πŸ‘ 42 πŸ” 12 πŸ’¬ 2 πŸ“Œ 6

Rare personal tweet:
Subletting our furnished apartment in Brooklyn for the spring at a significant discount. It's quite nice and in a fun location. under price. Email me know if you are interested, I will send pictures.

25.11.2024 20:39 πŸ‘ 25 πŸ” 7 πŸ’¬ 0 πŸ“Œ 0

The FATE group at @msftresearch.bsky.social NYC is accepting applications for 2025 interns. πŸ₯³πŸŽ‰

For full consideration, apply by 12/18.

jobs.careers.microsoft.com/global/en/jo...

Interested in AI evaluation? Apply for the STAC internship too!

jobs.careers.microsoft.com/global/en/jo...

25.11.2024 13:31 πŸ‘ 73 πŸ” 35 πŸ’¬ 4 πŸ“Œ 1

If you want to help improve peer review, we are looking for a new Co-CTO for ACL Rolling Review!

Requirements:
- Post-PhD
- Experienced with Python (including command line use)
- Time commitment of 3 hours a week on average (but note that you are not expected to review while serving)

Contact me!

24.11.2024 19:36 πŸ‘ 7 πŸ” 5 πŸ’¬ 0 πŸ“Œ 0
Post image

🌍✨Announcing the 4th edition of the NLP for Positive Impact workshop at #ACL2025 in Vienna!
Come join us and explore various social applications of NLP!
πŸ“’ Call for papers & more details coming soon!
πŸ”—https://sites.google.com/view/nlp4positiveimpact/acl-2025-workshop

21.11.2024 20:51 πŸ‘ 25 πŸ” 7 πŸ’¬ 1 πŸ“Œ 1
Preview
WhatsApp will soon transcribe your voice messages Finally, an easy way to skim through lengthy voice clips.

WhatsApp will soon transcribe your voice messages

21.11.2024 17:10 πŸ‘ 87 πŸ” 12 πŸ’¬ 10 πŸ“Œ 10

If you're interested in embeddings and SQLite you should be paying attention to sqlite-vec

Lots of neat stuff in this release - and the blog post provides a very clear explanation of what it can do

20.11.2024 17:49 πŸ‘ 92 πŸ” 9 πŸ’¬ 2 πŸ“Œ 1