this post was submitted on 04 Sep 2024
65 points (90.1% liked)

Ask Lemmy

27027 readers
671 users here now

A Fediverse community for open-ended, thought provoking questions

Please don't post about US Politics. If you need to do this, try !politicaldiscussion@lemmy.world


Rules: (interactive)


1) Be nice and; have funDoxxing, trolling, sealioning, racism, and toxicity are not welcomed in AskLemmy. Remember what your mother said: if you can't say something nice, don't say anything at all. In addition, the site-wide Lemmy.world terms of service also apply here. Please familiarize yourself with them


2) All posts must end with a '?'This is sort of like Jeopardy. Please phrase all post titles in the form of a proper question ending with ?


3) No spamPlease do not flood the community with nonsense. Actual suspected spammers will be banned on site. No astroturfing.


4) NSFW is okay, within reasonJust remember to tag posts with either a content warning or a [NSFW] tag. Overtly sexual posts are not allowed, please direct them to either !asklemmyafterdark@lemmy.world or !asklemmynsfw@lemmynsfw.com. NSFW comments should be restricted to posts tagged [NSFW].


5) This is not a support community.
It is not a place for 'how do I?', type questions. If you have any questions regarding the site itself or would like to report a community, please direct them to Lemmy.world Support or email info@lemmy.world. For other questions check our partnered communities list, or use the search function.


Reminder: The terms of service apply here too.

Partnered Communities:

Tech Support

No Stupid Questions

You Should Know

Reddit

Jokes

Ask Ouija


Logo design credit goes to: tubbadu


founded 1 year ago
MODERATORS
 

Obviously there's not a lot of love for OpenAI and other corporate API generative AI here, but how does the community feel about self hosted models? Especially stuff like the Linux Foundation's Open Model Initiative?

I feel like a lot of people just don't know there are Apache/CC-BY-NC licensed "AI" they can run on sane desktops, right now, that are incredible. I'm thinking of the most recent Command-R, specifically. I can run it on one GPU, and it blows expensive API models away, and it's mine to use.

And there are efforts to kill the power cost of inference and training with stuff like matrix-multiplication free models, open source and legally licensed datasets, cheap training... and OpenAI and such want to shut down all of this because it breaks their monopoly, where they can just outspend everyone scaling , stealiing data and destroying the planet. And it's actually a threat to them.

Again, I feel like corporate social media vs fediverse is a good anology, where one is kinda destroying the planet and the other, while still niche, problematic and a WIP, kills a lot of the downsides.

(page 2) 14 comments
sorted by: hot top controversial new old
[–] fruitycoder@sh.itjust.works 1 points 2 months ago (7 children)

Very much pro Open Source AI. Especially as a concept digital public good. With https://petals.dev/ being the most promising option that regard (imagine something like RAG for the arch wiki with very large models supported by the community!).

It feel very enthusiasts right now. Where I feel like I'm just on the cusp of having usable set up.

I personally really want a full Dev that just takes gitlab issues and runs codes against tests until it passes, and then cycles between attempting to explain what it doing and refactoring until that explanation is reasonably simple, then submit PR.

At the moment I am trying to use it as a copilot (ollama lama3, continue, and devonAI vscode plugins) all on my MacBook (my Linux machine were too small gpu wise, at least first time I attempted). That said it ok for questions no real luck on a decent experience for actually making anything.

The next step to me for it to move from enthusiast to hobbiest would be:

  1. Models that just work on my machine. I had to do a lot of trial and error just get performant models.
  2. Models just my use case. I don't know what model support tooling, or multimodal inputs. What models are actually optimized for programing, for actions (ala openinterpretor), for reviewing documents, etc.
  3. For federated (like pedals.dev) I feel like I need some sane data guardrails. I don't want my medical documents anywhere near "bittorrent style" anything, but would absolutely love to leverage it for better outcome on opensource projects without secrets file. This also feeds into point 2 to me.
  4. More sane RAG. Maybe even IPFS links to caches or DBs for popular data sources (like code docs for example).

I feel like there has to be a better way for this. Maybe its just selinux rules for data tags for locking down my local system and some routing config file at the root of my projects. Idk tbh

load more comments (7 replies)
[–] Lost_My_Mind@lemmy.world 0 points 2 months ago (4 children)
load more comments (4 replies)
load more comments
view more: ‹ prev next ›