Just reported by Mohamed Aruham on Twitter
The oldest tweets I could find that actually started reporting this are from ~16 days ago.
https://x.com/Piotrdotcom/status/1829126494574067992
They reference a page here that was posted on Aug 29th.
Just reported by Mohamed Aruham on Twitter
The oldest tweets I could find that actually started reporting this are from ~16 days ago.
https://x.com/Piotrdotcom/status/1829126494574067992
They reference a page here that was posted on Aug 29th.
Well, I don’t think anyone is really sure just how exactly time travel can mess with stuff. I would probably take a page from some time travel movie I saw. I would want to avoid any sort of temporal paradox and avoid too many changes.
So, I would probably remove myself from the equation as much as possible. Go to a hotel or somewhere where I can avoid accidentally running into anyone I might know. Leave all electronics behind, but take a book or something. Spend the whole day avoiding TV/News/People and just reading or work on perfecting a skill. At the end of the day I would call up a broker from the hotel room and find out which stock had the greatest percentage gain that day. Just enough information for one good trade.
Then I would go back to that morning, buy up a ton of that stock, live out life normally, then sell the stock at the end of the day.
Rinse and repeat for a short time.
I would absolutely avoid something like winning the lottery, but for those of you who would use time travel to win the lottery, you might want to follow the advice from this comment here: https://old.reddit.com/r/AskReddit/comments/24vo34/whats_the_happiest_5word_sentence_you_could_hear/chb38xf/
I just want to be able to set alarms with their calendar app (where it currently only sends notifications).
Ok, but the most important part of that research paper is published on the github repository, which explains how to provide audio data and text data to recreate any STT model in the same way that they have done.
See the “Approach” section of the github repository: https://github.com/openai/whisper?tab=readme-ov-file#approach
And the Traning Data section of their github: https://github.com/openai/whisper/blob/main/model-card.md#training-data
With this you don’t really need to use the paper hosted on arxiv, you have enough information on how to train/modify the model.
There are guides on how to Finetune the model yourself: https://huggingface.co/blog/fine-tune-whisper
Which, from what I understand on the link to the OSAID, is exactly what they are asking for. The ability to retrain/finetune a model fits this definition very well:
The preferred form of making modifications to a machine-learning system is:
- Data information […]
- Code […]
- Weights […]
All 3 of those have been provided.
I don’t understand. What’s missing from the code, model, and weights provided to make this “open source” by the definition of your first link? it seems to meet all of those requirements.
As for the OSAID, the exact training dataset is not required, per your quote, they just need to provide enough information that someone else could train the model using a “similar dataset”.
I did a quick check on the license for Whisper:
Whisper’s code and model weights are released under the MIT License. See LICENSE for further details.
So that definitely meets the Open Source Definition on your first link.
And it looks like it also meets the definition of open source as per your second link.
Additional WER/CER metrics corresponding to the other models and datasets can be found in Appendix D.1, D.2, and D.4 of the paper, as well as the BLEU (Bilingual Evaluation Understudy) scores for translation in Appendix D.3.
The STT (speech to text) model that they created is open source (Whisper) as well as a few others:
I initially think this same thing every time I see someone mention MTG on here, glad I’m not the only one.
I don’t think this is specifically an “AI” problem as much as it’s a privacy issue with the way companies are buying and selling our info for targeted advertising. These models are definitely enabling them to do more with the data that they have as well as to collect more information from us in new ways.
Yeah, the other thing I could see happening is a similar tactic used by scammers where they use Mules who pick up mail from various Airbnbs throughout whatever country, but this would definitely limit most bot operations… Unless some organization specializes in this and just offers some service to create a bunch of accounts for anyone willing to pay.
Also, how many accounts would you limit to a single address, and how long would you lock up an address before it could be used again (given that people do move around from time to time).
edit:typo.
That’s a good point. I didn’t know about the USPS Form 1583 for virtual mailboxes… Although that is a U.S. specific thing, so finding a similar service in a country that doesn’t care so much might be the way to go about that.
Yep, exactly this. It might deter some small time bot creators, but it won’t stop larger operations and may even help them to seem more legitimate.
If anything, my favorite idea comes from this xkcd:
Easy way to get around that with “virtual” addresses: https://ipostal1.com/virtual-address.php
Just pay $10 for every account that you want to create… you may as well just go with the solution of charging everyone $10 to create an account. At least that way the instance owner is getting supported and it would have the same effect.
“Just download our app on the Microsoft Store/App Store!” /s
Yeah, a decision to modify copyright so that it affects training data as well would devastate open source models and set us back a bit.
There are many that want to push LLMs back, especially journalists, so seeing articles like this are to be expected.
edit: a word.
All ice cream (and related desserts) will get harder as they get colder.
It feels like you’re comparing ice cream/desserts that are completely frozen to ice cream/desserts that are partially frozen, which is not what this post is about…
Although if the ice cream does get slightly liquidy before re-freezing, it will be much harder than it was before. This is why one of the most important factors when making ice cream is to continually mix up the ice cream while it freezes.
That’s a big misconception with what quantum internet is (and what quantum entanglement actually allows for) as explained by this physicist: https://www.youtube.com/watch?v=u-j8nGvYMA8
Quantum Internet doesn’t mean that you can transmit data faster than the speed of light.
Quantum Internet just means you get an ultra secure connection, but it’s super susceptible to noise (in other words, you can’t send a lot of data reliably and it would be terrible for that).
At best this would be useful for being absolutely sure that some encryption keys were sent successfully without being intercepted by anyone else.
“Stealth” is a useful tool for visiting Reddit. It’s available on F-Droid. https://gitlab.com/cosmosapps/stealth
Otherwise using old.reddit with a browser has been a decent fallback.
Best exchange I heard about this topic:
Person 1: “Stop being such a smart ass”
Person 2: “I will when you stop being such a dumb ass”
Probably “Trap Adventure 2”.
Imagine an old Mario game where Bowser has the most rediculous traps set up. You need to memorize all of the trap locations as well as have the coordination to tip-toe around them to survive.
https://www.youtube.com/watch?v=_nW9k6k1I3k