Discovered this morning that Maven heymaven.com (a social media startup who's CEO is ex OpenAI "Ken Stanley: leading the Open-Endedness Team at OpenAI") is mass importing public posts from the #fediverse with no links back to the original and no way to delete them. It seems there is no Opt-out or Opt-in mechanism at all. It also has posts from #Bluesky pulled in via @bsky.brid.gy that are also not linked back to the original.
Here's an example: app.heymaven.com/profile/66927
reshared this
wakest ⁂
in reply to wakest ⁂ • • •John
in reply to wakest ⁂ • • •cyplo
in reply to wakest ⁂ • • •wakest ⁂
in reply to wakest ⁂ • • •wakest ⁂
Unknown parent • • •damon
in reply to wakest ⁂ • • •The VHS Wizard 🦝📼🧙
in reply to wakest ⁂ • • •The stupidest part of this is... I got curious and was messing around with the "tags" and noticed that instead of using the hashtag's name, it gives everything a number.
For instance, Peertube Instance is app.heymaven.com/tag/154847
So I was like... hrrm, I wonder what the very first tags are.
We have the following:
1 - AI
2 - ChatGPT
3 - Startups
4 - fundraising
5 - tech
Maven
app.heymaven.comwakest ⁂
in reply to The VHS Wizard 🦝📼🧙 • • •wakest ⁂
Unknown parent • • •wakest ⁂
in reply to wakest ⁂ • • •1.12 million fediverse posts scraped by AI startup Maven founded by ex OpenAI lead...
confirmation by Maven CTO Jimmy Secretan app.heymaven.com/discover/1190…
Maven
app.heymaven.comreshared this
𝚜𝚎𝚕𝚎𝚊, wakest ⁂ and Tim Chambers reshared this.
DJ Sundog
in reply to wakest ⁂ • • •hate it.
for the record, I emailed jimmy@heymaven.com when I saw your post and checked out their T&Cs. I informed him that he was violating my content licensing by scraping the toot-lab and gave him a reference link to my shadow profile on their service, and that if they persisted in misusing my posts I'd have to look at legal remedies, and he just replied and said he has "removed the data and will work this week to prevent future ingestion. Thanks and sorry for the inconvenience."
so, super annoying and mega-manual opt-out process, but the profile page pretending to be me is indeed now removed.
wakest ⁂ reshared this.
the most validated enby
in reply to DJ Sundog • • •DJ Sundog
in reply to the most validated enby • • •the most validated enby
in reply to DJ Sundog • • •The Gibson 🅅
in reply to DJ Sundog • • •We have seen at least one DM show up in their search... beware.
wakest ⁂
in reply to The Gibson 🅅 • • •Tunera Type Foundry
in reply to wakest ⁂ • • •feld
in reply to wakest ⁂ • • •shadowwwind
in reply to wakest ⁂ • • •app.heymaven.com/discover/9787…
Maven
app.heymaven.comwakest ⁂
in reply to shadowwwind • • •shadowwwind
in reply to wakest ⁂ • • •RavenCode
in reply to wakest ⁂ • • •wakest ⁂
in reply to wakest ⁂ • • •So the CTO is here at @jsecretan and has clarified that they are in the process of implementing bidirectional #ActivityPub, but in the meantime ingested the "federated timeline" of Mastodon.social
You can look at their AP response here: staging.maven.ly/mastodon/acto… though it doesn't seem to be live on their main domain.
Tim Chambers reshared this.
Oblomov
in reply to wakest ⁂ • • •wakest ⁂
in reply to wakest ⁂ • • •Tim Chambers reshared this.
RealGene ☣️
in reply to wakest ⁂ • • •@jsecretan
Maybe this is naive, but once an LLM has "ingested" source material, what remains is a bunch of statistics; the "source" is no longer required or stored.
Trying to "remove" it from a model sounds a lot like trying to unbake a cake.
KonsolentieR
in reply to wakest ⁂ • • •I'm not confused. I'm pretty sure 🇪🇺 and 🇩🇪 laws were, are and will be violated.
I am now preparing my request for erasure in accordance with Art. 17 GDPR. How to contact the data protection officer? Is there a data protection officer?
Tim Chambers
in reply to wakest ⁂ • • •wakest ⁂
in reply to Tim Chambers • • •wakest ⁂
in reply to wakest ⁂ • • •UPDATE 3: CTO Jimmy (@jsecretan) says "We have paused everything related to our Fediverse ingestion for now and we are removing everything ingested. To be honest, the extreme negative reaction was a surprise to me, as I thought interaction between disparate systems was the entire point, but clearly we didn't navigate the culture correctly." - app.heymaven.com/discover/1190…
And @deadsuperhero wrote an article mostly from this thread for @wedistribute.org now live at wedistribute.org/2024/06/maven…
... show moreUPDATE 3: CTO Jimmy (@jsecretan) says "We have paused everything related to our Fediverse ingestion for now and we are removing everything ingested. To be honest, the extreme negative reaction was a surprise to me, as I thought interaction between disparate systems was the entire point, but clearly we didn't navigate the culture correctly." - app.heymaven.com/discover/1190…
And @deadsuperhero wrote an article mostly from this thread for @wedistribute.org now live at wedistribute.org/2024/06/maven…
Maven
app.heymaven.comWe Distribute
2024-06-12 20:26:10
@pettter@social.accum.se
in reply to wakest ⁂ • • •Parade du Grotesque 💀
in reply to wakest ⁂ • • •@jsecretan @deadsuperhero @wedistribute.org
Sorry, but... "To be honest, the extreme negative reaction was a surprise to me, as I thought interaction between disparate systems was the entire point, but clearly we didn't navigate the culture correctly."
YOU THINK?!?! 🤣
The VHS Wizard 🦝📼🧙
in reply to wakest ⁂ • • •@jsecretan @deadsuperhero @wedistribute.org
I don't even think it's a question of "Fediverse Culture". I personally would honestly be fine with them importing all of the Fediverse to their thing - IF there were back-links so you could, like, interact with the author, see context and follow-ups, all that fun stuff.
What he did was basically like cropping out an artist's signature before uploading it to 9gag as your own - just to the entire Fediverse.
wakest ⁂
in reply to The VHS Wizard 🦝📼🧙 • • •D3
in reply to wakest ⁂ • • •wakest ⁂
in reply to D3 • • •Zalasur 🇺🇦
in reply to wakest ⁂ • • •@jsecretan @deadsuperhero @wedistribute.org
I'm pretty sure they feel very bad about the whole development*
* (getting caught)
quixote
in reply to wakest ⁂ • • •@jsecretan @deadsuperhero What an utter ass. "We have paused ... ingestion for now [for his pathetic AI]"
" I thought interaction between disparate systems was the entire point"
What part of "interaction" does this turkey not understand? Get a dictionary of synonyms. "Ripoff" is not one of them.
Luna Lactea
in reply to wakest ⁂ • • •nonlinear
in reply to wakest ⁂ • • •Susan Kaye Quinn 🌱(she/her)
in reply to wakest ⁂ • • •the gaslighting continues
Jeff Sikes
in reply to wakest ⁂ • • •wakest ⁂
in reply to Jeff Sikes • • •lachlan slowly taming rust
in reply to wakest ⁂ • • •Talesto
in reply to wakest ⁂ • • •Fediverse doesn't Google properly.
But this wat all our posts become the part of the Universe.
Forever)
The 500 Hats of LambdaCalculus
in reply to wakest ⁂ • • •Pseudo Nym
in reply to wakest ⁂ • • •wakest ⁂
Unknown parent • • •wakest ⁂
Unknown parent • • •cryptix
in reply to wakest ⁂ • • •Beschwerdeformular - Start
kontakt.datenschutz-berlin.deBonnettsBooks
in reply to wakest ⁂ • • •Thank you for sharing.
I found my shadow profile on maven, too, spanning 5/17/24 - 6/8/24, but not every one.
They've stripped hashtags from the bottom of my posts. Image AltText seems to be missing or inaccessible there. And, they add their own imprecise tags.
I wonder if hashtags in the body of a post would stop them, get stripped – bastardizing the content, or simply be ignored? What about Emojis? I'll throw a hashtag into today's post and see if it turns up there in a few days.
Cairo Braga [toot]
in reply to wakest ⁂ • • •mem_somerville
in reply to wakest ⁂ • • •wakest ⁂
Unknown parent • • •Jennifer
in reply to wakest ⁂ • • •hömma
in reply to wakest ⁂ • • •tagging instance admins
@orga
please read whole thread 🙏
(((JaneinNJ)))
in reply to wakest ⁂ • • •This sounds bad but as a non-technical person, I don’t understand much of it. Questions:
1. Is it likely that everyone here has had data scraped?
2. Can we protect ourselves?
3. If so, how?
I am sure there are more questions but these come to mind immediately.
Thanks.
wakest ⁂
in reply to (((JaneinNJ))) • • •(((JaneinNJ)))
in reply to wakest ⁂ • • •wakest ⁂
in reply to (((JaneinNJ))) • • •Issues · jsecretan/maven-public
GitHub(((JaneinNJ)))
in reply to wakest ⁂ • • •Sarah W
in reply to wakest ⁂ • • •wakest ⁂
in reply to Sarah W • • •Sarah W
in reply to wakest ⁂ • • •Cykonot
in reply to wakest ⁂ • • •bumblefudge
in reply to wakest ⁂ • • •TDM·AI Protocol | TDM·AI
docs.tdmai.orgIvan
in reply to wakest ⁂ • • •OpticalNail 🇵🇸
in reply to wakest ⁂ • • •@jippi Any thoughts about this?
@liaizon
Jippi 🇩🇰
in reply to OpticalNail 🇵🇸 • • •OpticalNail 🇵🇸
in reply to Jippi 🇩🇰 • • •@jippi
I did see through this thread some mentions of DMs leaking, although I have not looked more into it, I'm not sure what that's about.
If it's only public posts, then it makes sense.
Joël 🍵
in reply to wakest ⁂ • • •Good news, the ingestion is stopped and deletion is in progress. Apparently they did not expect the negative feedback.
Jimmy Secretan - 20 minutes ago
We have paused everything related to our Fediverse ingestion for now and we are removing everything ingested. To be honest, the extreme negative reaction was a surprise to me, as I thought interaction between disparate systems was the entire point, but clearly we didn't navigate the culture correctly.
app.heymaven.com/discover/1190…
Maven
app.heymaven.comfear is not a Weltanschauung
in reply to wakest ⁂ • • •The VHS Wizard 🦝📼🧙
in reply to wakest ⁂ • • •It gets dumber (again again)
This isn't even the first time they've tried something like this. Here's a post from months ago where someone notices that "new" Maven has 3-year old posts and they confirmed down in the comments that they imported a bunch of "high quality sources" and set them to different dates to "spark discussion"
app.heymaven.com/discover/2689…
Maven
app.heymaven.comwakest ⁂
in reply to The VHS Wizard 🦝📼🧙 • • •binchicken
in reply to wakest ⁂ • • •Daniel Arrazola
in reply to wakest ⁂ • • •LouD
in reply to wakest ⁂ • • •@bsky.brid.gy @aral
webhat
in reply to wakest ⁂ • • •@aral
The CTO is formerly the VP of Ads and Premium Services at Brave, that checks out
Aral Balkan
in reply to webhat • • •Elena
in reply to wakest ⁂ • • •ricodegrumpy
in reply to wakest ⁂ • • •