ELI5: What role could Agentic AI have in the future, and how should individuals prepare?

tristynalxander@mander.xyz · 29 days ago

ELI5: What role could Agentic AI have in the future, and how should individuals prepare?

e0qdk@reddthat.com · 28 days ago

My limited understanding of “agentic AI” is that it’s LLMs prompted by a script repeatedly in a loop. e.g. You give it an initial prompt, the LLM spits out a response indicating what it thinks it should do next (like run a program) and the script looks at the output of the LLM, does what is asked if it can, then feeds the results back in as the next prompt (or tells the LLM that it can’t/won’t do that if so programmed, I assume), until the LLM indicates to the script in one of its responses that the complex task is done – or indefinitely if the prompt is the sort that will never terminate.

You can limit what the overall system is capable of by careful design of the control script – just using LLMs to bridge the gaps for tasks that are hard/impossible to program traditionally. For controlling a robot, I’d guess that you’d want to have actions for things like taking a picture with a camera (or reading other sensors) that the LLM can analyze, repositioning arms, controlling motors, etc?

If I knew how to set up a robotic arm that could safely turn the pages of a traditional book, then combining it with a camera or flatbed scanner, a couple different vision models + traditional OCR (to sanity check each other), and wiring the whole thing up with a custom script to make really high-quality ebooks out of the books I own that no one has bothered to digitize would be neat… Maybe some day.

You might get more informed responses on db0 or another site; most of Lemmy is straight up AI-hostile. I have mixed feelings on it personally – I really don’t like the AI companies DDOSing the internet for profit while driving up the cost of electronics (etc), but writing off local AI tech entirely also seems like throwing out the baby with the bathwater to me…

tristynalxander@mander.xyz · 28 days ago

Yeah, I was thinking about the code too. I think the looping output explanation makes a lot of sense and puts the “Agentic AI” into a healthier/more-realistic framework.

I’m a lot more inclined to write my own loops than trust someone else’s AI, but with that framework I’m not sure how useful these “AI agents” will be for most non-text based problems since that’d require converting back and forth between text based mediums and whatever medium the problem is in which seems very problematic. For code I could try giving prompts to catch typos, makes tests, and improve functions. Even this seems pretty limited since usually the AI can’t see the larger picture identify the problem and plan a solution on its own. Or maybe it can in some contexts, but not the stuff I’m working on – maybe my work isn’t routine enough, Idk. I have been using it find learn algorithms and get numpy notations, but it just doesn’t grasp what math needs to be done when I try to explain my problems.

I’ll have to think more on how to set up loops that are more generally useful and won’t require more work in making it sure it’s doing what I want it to do than work it gets done.

e0qdk@reddthat.com · 27 days ago

non-text based problems

I don’t know enough about the specifics of what you’re doing to really give detailed suggestions, but I have been surprised by models like qwen3.5/3.6 giving reasonable results to questions about images – like answering questions based on comparing a few images. For example:

Which of these images were taken from the same place?
Which of these images were taken in the AM and which in the PM?
Is the third image more like the first or the second? Answer “1” or “2” only indicating which of the first two images it is most similar to.
What sort of tags would you attach to these images? Provide a list of tags as JSON as the only response.

Those prompts actually worked for imagery from camera systems I have to deal with for my job – which, frankly, shocked me. The AM/PM one in particular read timestamps in the image (i.e. did OCR automatically) and converted from a 24 hour clock to answer the question. I’ve implemented that in scripts with older computer vision tools (like tesseract) that needed a lot of hand holding; the fact that current models can just do it is sort of mind blowing to me…

tristynalxander@mander.xyz · 27 days ago

I think it’s very much a “how much data on this exist” sort of problem for most of these. Like I can pick out bacteria from fungi on an augur plate trivially, but I don’t know if there are databases of augur plates characterizing different growths with different background colors and all the diversity that real life has. Honestly, I haven’t tried this yet. It might be able to get it just fine or might be able to get it if I backlight the plate - of course, at that stage there are other programs for detecting colonies.

The dream, for me, is to get it to understand the protein structure files and DNA sequence files then hook it up to some lab robotics and automate experiments that are mostly trivial but slightly dynamic. Maybe start with something simple like cloning then build out to other methods. Some of this stuff exists already but companies charge you a fortune and go out of business (or get bought up and discontinued) constantly, so it kinda needs to be stuff I can build and maintain myself – or FOSS.

Even for purely computer stuff, anytime I try to get the AI to help with my proteins, it’s functionally useless because it doesn’t have a way to “see” the protein’s structure file. I can write my own scripts to help with that, but I’ll have to work on the connection between the language the AI thinks in and the actual things my code detects. Or maybe I can tell it to ask questions based on the writing then run code that analyzes the protein to give answers to specific questions… Even then, much of what I’d want help with looking at proteins is how to write analyses of points in 3D space, and while it has helped me pick the right algorithms (sometimes), I’m haven’t really been able to give it enough information to let it check that things are being implemented correctly (I think this is alignment). That might something like hooking it up to pymol (3D viewer) or it might just be a bit too dumb. It’s hard to say without trying it, and it’s a lot of work for something it’s likely to get get confused about even with the ability to “look” at the protein structure.

I feel like, for coding, one thing I’m going to have to get it to do is stop after it makes a function or something so I can check that it’s still going where it’s supposed to go or tell it what the next function needs to do. I don’t know. Maybe I’ll start with lots of hand holding then slowly build it up until it can reliably do more or I can’t get it to be reliable enough. Maybe there’s a coding community on lemmy that’s a decent place to talk shop on how to build these scripts up and what local models are good at what?

e0qdk@reddthat.com · 27 days ago

I tried throwing this image at qwen3.6 locally with the prompt “What is in this image?” and it spat out:

Based on the visual evidence, this is a micrograph (a photograph taken through a microscope) of bacteria.

Here is a breakdown of what you are seeing:

Organism Type: These are bacteria. You can tell by their small size relative to the scale bar in the bottom right corner (which represents 10 micrometers).

Stain: They are stained purple/blue, which indicates they are Gram-positive bacteria. In a Gram stain, Gram-positive bacteria retain the crystal violet dye and appear purple.

Shape: The individual bacterial cells are spherical or round, which classifies them as cocci (singular: coccus).

Arrangement: The most distinct feature is that the cocci are arranged in chains.

Conclusion: This image shows Gram-positive cocci in chains, which is the characteristic appearance of bacteria from the genus Streptococcus (or possibly Enterococcus). Common examples include Streptococcus pyogenes (which causes strep throat).

So you might be able to get further than you expect. You might try collecting some of your own sample data for things you understand (like particular bacteria you’re interested in) and providing those images along with a prompt that includes labels for your sample imagery. (Like “Image 1 is an example of (…whatever…). Image 2 is an example of (…something else…). Which (if any) of these are in Image 3?”)

I don’t know how well it would work, but worth a try.

The dream, for me, is to get it to understand the protein structure files and DNA sequence files

Even then, much of what I’d want help with looking at proteins is how to write analyses of points in 3D space

I tried throwing this image at qwen3.6 as well, and it spits this out (but takes a REALLY long time to do it on my local hardware):

Based on the visual evidence, this image is a 3D molecular visualization of a protein structure, likely created using software such as PyMOL, UCSF Chimera, or VESTA.

Here is a breakdown of the specific components visible in the image:

The Protein (Blue Ribbon):

The large, coiling blue structure is the protein backbone.

It is rendered as a “ribbon” or “cartoon” diagram, which is a standard way to visualize the secondary structure of proteins.

The coiled sections represent alpha-helices, which make up the majority of this protein’s structure. This “helix bundle” fold is characteristic of proteins in the globin family, such as myoglobin or hemoglobin, which are responsible for oxygen transport and storage.

The Central Ligand/Co-factor (Middle Right):

Nestled within the protein fold is a complex molecule shown in a “stick” and “sphere” representation.

The Flat Ring: You can see a flat, ring-like structure with blue sticks (representing nitrogen atoms). This is characteristic of a porphyrin ring.

The Red Sphere: There is a large red sphere in the center of the ring. In this context, this almost certainly represents an Iron (Fe) atom.

Conclusion: This complex is a heme group (or heme prosthetic group). Heme is the molecule responsible for binding oxygen in hemoglobin and myoglobin, or for electron transfer in cytochromes. The long chains extending from the ring are likely the side chains of the heme molecule (propionate groups) or a bound drug/inhibitor interacting with the iron.

The Ion (Bottom Right):

There is a small molecule floating near the bottom right of the protein.

It has a yellow center with four red arms arranged in a cross/tetrahedral shape.

In chemical coloring (CPK coloring), yellow represents Sulfur and red represents Oxygen. This indicates a Sulfate ion ($SO_4^{2-}$). Sulfate ions are common in crystal structures of proteins and are often used to stabilize the structure or mimic phosphate groups.

Summary: You are looking at a structural biology model of a protein (likely a globin like myoglobin) containing a heme cofactor (the molecule with the iron) and a sulfate ion. This type of image is used by scientists to understand how the protein works at the atomic level.

You might be able to it to do something useful with analysis of protein renderings, but I think you’ll need some creativity with the prompts to stop it from overthinking things… It was able to recognize this as myoglobin in the “thinking” stage quickly enough, but then kept second guessing itself about various aspects for a long time. I also tried throwing it at gemma4 but it didn’t really give me much besides recognizing it as some sort of protein structure.

hook it up to some lab robotics and automate experiments

You might find this project and associated HN post interesting – it’s basically someone doing that with electronics analysis instead of biology, IIUC. No idea how well it works, but interesting to look at and think about.

Maybe there’s a coding community on lemmy that’s a decent place to talk shop on how to build these scripts up and what local models are good at what?

There’s !localllama@sh.itjust.works and probably some communities on lemmy.dbzer0.com as well – they’re not super-active though (at least from what’s made it to my attention browsing by all).

tristynalxander@mander.xyz · 26 days ago

Yeah, I think this is conversation is entering the just try to build it territory, but definitely worth a try.

There’s !localllama@sh.itjust.works

Thanks. I’ll have to get through my current project, but this will be a good resource when I do more on LLMs.