Agentes Autônomos vs. CFM 2.454: A IA que Age e a Lei que Exige que o Médico Decida

↑ 🧭

IA Agêntica · Regulação · Prática Clínica

Agentes Autônomos vs. CFM 2.454: A IA que Age e a Lei que Exige que o Médico Decida

Em junho, um agente de IA superou clínicos experientes operando sozinho num prontuário. Dois meses depois entra em vigor a resolução do CFM que proíbe exatamente esse tipo de autonomia — a colisão é a fronteira onde a medicina brasileira vai se redesenhar.

Dr. Mbula Luzingu Barros | Médico Pediatra Intensivista · 25 anos de UTI Pediátrica | Consultor em IA na Saúde | Fundador da INOVAMED | Criador da Metodologia AIMED

📅 Publicado em 22 de junho de 2026

⏱ Vigência da CFM 2.454 em: 65 dias — 26 de agosto de 2026

Por Que Este Cruzamento Importa

No dia 17 de junho de 2026, a Nature publicou um trabalho que vale a pena ler antes que vire manchete distorcida. Ferber e colaboradores apresentaram a MIRA (Medical Intelligence for Reasoning and Action), um agente de IA operando dentro de um prontuário eletrônico em ambiente sandboxed. Não é um chatbot que sugere — é um agente que age: coleta história, solicita e interpreta exames de laboratório, imagem e microbiologia, formula diagnósticos diferenciais e propõe condutas, incluindo prescrever medicação, agendar cirurgia e planejar internação. Em casos retrospectivos reais, a MIRA superou clínicos experientes, aderindo a padrões de segurança e diretrizes.

Cinco dias depois eu escrevo isto, e a pergunta que importa não é “a IA vai substituir o médico?”. Essa pergunta é preguiçosa. A pergunta que importa para quem opera no chão do hospital é: o que acontece quando a tecnologia que age sozinha encontra uma legislação desenhada para impedir que ela aja sozinha? Essa legislação já existe no Brasil, e seu nome é Resolução CFM 2.454/2026.

O Que a MIRA Realmente Fez

A leitura honesta do artigo separa o sinal do ruído. A MIRA não é um oráculo. Os próprios autores são explícitos: o agente não substitui equipe humana especializada e não atingiu perfeição em todas as escolhas terapêuticas — a seleção de antibióticos específicos foi um dos pontos de falha. Eric Topol, comentando o achado, situou-o ao lado da AMIE como marco da “IA agêntica” em medicina, com a mesma ressalva: supervisão humana estrita e salvaguardas em nível de paciente continuam sendo a condição de existência dessas ferramentas.

Traduzindo para o beira-leito

A MIRA é o residente brilhante de primeiro ano que conhece a literatura inteira, acerta o raciocínio na maioria das vezes — e ainda assim você não deixa prescrever vancomicina sozinho às três da manhã. A diferença é que esse “residente” escala para milhares de leitos simultâneos. É justamente a escala que transforma um erro pontual de antibiótico em problema de saúde pública.

⚠ O que NÃO se deve concluir do paper

Superar clínicos em casos retrospectivos simulados não é o mesmo que segurança prospectiva à beira-leito. Casos retrospectivos têm desfecho conhecido e ruído controlado; o plantão real é ambíguo, dinâmico e tem custo assimétrico de erro. O dado é promissor — não é licença de deploy autônomo.

A Colisão: MIRA × CFM 2.454/2026

Em 11 de fevereiro de 2026 o Conselho Federal de Medicina publicou a Resolução nº 2.454/2026 (DOU de 27/02/2026), com vigência a partir de 26 de agosto de 2026 — 180 dias depois. Ou seja: quando a MIRA apareceu na Nature, o Brasil já tinha, no papel, a moldura jurídica que define o que dela é utilizável aqui. A tabela abaixo cruza, capacidade por capacidade, o que a MIRA faz com o que a resolução exige.

Capacidade da MIRA	O que a CFM 2.454 exige	Veredito
Coleta autônoma de história clínica	Apoio à decisão; transparência ao paciente sobre uso de IA	Compatível com supervisão e informação ao paciente
Solicita e interpreta exames	Julgamento crítico do médico sobre cada output	Compatível se houver revisão médica documentada
Gera diagnósticos diferenciais	Decisão final e responsabilidade integral humana	Condicional — vedada a comunicação autônoma
Prescreve, agenda cirurgia e interna autonomamente	Vedada a delegação de decisões terapêuticas sem mediação humana	Incompatível — risco alto a inaceitável

O veredito é nítido: a MIRA, do jeito que o paper a descreve, é incompatível com o deploy clínico brasileiro — não por mérito técnico, mas pelo grau de autonomia. O que sobrevive aqui é a MIRA rebaixada a copiloto, com o médico no laço em cada decisão de conduta.

Onde a MIRA Cairia na Classificação de Risco

Os Arts. 12 e 13 da resolução estabelecem quatro níveis de risco — baixo, médio, alto e inaceitável — e obrigam a instituição (não o fornecedor) a classificar e documentar o sistema. Os critérios incluem grau de autonomia do modelo, criticidade do contexto e nível de intervenção humana no resultado. Um agente que executa condutas sem revisão obrigatória é precisamente o que a norma desenha no topo da pirâmide.

⚠ Ponto crítico: a classificação é obrigação sua

Se você usa hoje qualquer sistema de apoio à decisão clínica sem ter feito a avaliação formal de risco, está operando fora da conformidade a partir de 26/08/2026 — independentemente do que o fornecedor afirme sobre o produto. A MIRA importada “as is” não seria um sistema de médio risco gerenciável: sem o rebaixamento de autonomia, ela é uso vedado.

A “Supervisão Significativa” é o Verdadeiro Trabalho

O conceito mais subestimado da resolução é o de supervisão significativa. Ele inverte o ônus da prova da segurança: não basta o fornecedor dizer “tem humano no laço”; é preciso provar que o humano tem condições reais de revisar — tempo, interface, contexto clínico e capacidade de auditar o que a IA propôs.

E aqui a evidência clínica recente joga contra a fantasia da automação total. A Nature Medicine mostrou em 2026 que LLMs generalistas de fronteira superaram ferramentas clínicas especializadas em conhecimento, alinhamento e queries do mundo real — o gargalo já não é a capacidade bruta do modelo. Em paralelo, estudos de raciocínio clínico mostram que esses mesmos modelos despencam em testes de concordância de script, que medem como o clínico atualiza a decisão diante de informação nova e ambígua. O modelo brilha na múltipla escolha e tropeça no raciocínio incremental do beira-leito — precisamente o terreno do intensivista.

O Buraco Pediátrico

O delta que poucos veem

A MIRA foi avaliada majoritariamente em populações adultas. A literatura é explícita sobre o buraco pediátrico: a variabilidade fisiológica por faixa etária, as diferenças de apresentação e a escassez de datasets de qualidade tornam a terapia intensiva pediátrica o último território a ser “automatizado” com segurança.

Para quem atua em UTI pediátrica e pós-operatório cardíaco, isso não é limitação acadêmica distante — é a garantia de que, por anos, o agente será no máximo um copiloto. E o piloto que sabe construir e auditar o copiloto será o profissional insubstituível.

A Linha do Tempo

Da publicação da norma ao agente autônomo

Por que a coincidência de calendário entre a MIRA e a CFM 2.454 não é fortuita.

11–27/02/2026

CFM 2.454/2026 publicada

Aprovada em 11/02; publicada no DOU em 27/02. Vacatio legis de 180 dias.

➜

17/06/2026 · Você está aqui

MIRA publicada na Nature

Agente autônomo supera clínicos em prontuário simulado — cinco dias antes deste artigo. Janela de adaptação ainda aberta.

➜

26/08/2026

Vigência plena da CFM 2.454

Todos os deveres tornam-se exigíveis ética e disciplinarmente. Agentes autônomos sem revisão médica passam a ser uso vedado.

➜

Após

Fiscalização dos CRMs

Conselhos Regionais iniciam supervisão ativa. A dupla conformidade CFM + ANVISA é o cenário real para sistemas de alto risco.

O Que Fazer na Segunda-Feira

Da teoria regulatória à conduta de plantão☐Migre da “IA informal” para a IA institucional. O ChatGPT pessoal no celular durante a visita morre com a 2.454. Sobrevive a IA corporativa, rastreável, com registro de quem propôs o quê e quem decidiu.
☐Desenhe a interface de supervisão, não só o modelo. A conformidade não está no algoritmo — está na tela em que o médico revisa. Tempo, contexto e trilha de auditoria são requisitos de produto.
☐Documente o diferencial humano. Onde o modelo erra (antibiótico específico, raciocínio incremental, pediatria) é onde o médico agrega valor mensurável. Registrar isso constrói a defesa ética e jurídica do seu serviço.
☐Classifique e documente o risco de cada sistema em uso. Antes de 26/08, com base nos critérios do Art. 12. Sem essa avaliação formal, você está fora da conformidade independentemente do fornecedor.

Entenda a IA Agêntica e a CFM 2.454 na Profundidade que Elas Merecem

A metodologia AIMED forma médicos que usam IA com rigor crítico, consciência regulatória e aplicação clínica real — incluindo governança e conformidade na prática, com framework de implementação e casos aplicados.

Conheça o AIMED →

Considerações Finais

A MIRA não é uma ameaça ao médico brasileiro. É um espelho. Ela mostra exatamente o que a máquina já faz melhor — e, por contraste, ilumina o que continua sendo nosso: a decisão sob incerteza, a responsabilidade pelo paciente concreto e a capacidade de auditar a própria ferramenta.

A Resolução 2.454 não freou o futuro; apenas exigiu que o futuro tenha um nome assinado embaixo. E esse nome, por lei e por mérito, continua sendo o do médico.

💡 Connecting the Dots: a coincidência de calendário é estratégica. A MIRA prova que a autonomia técnica chegou; a 2.454 garante que a responsabilidade jurídica não se automatiza junto. O valor profissional migra da execução clínica (que a máquina copia) para a governança da máquina — quem sabe classificar o risco, projetar a supervisão significativa e auditar o agente. A próxima vantagem competitiva do intensivista não é diagnosticar mais rápido que a IA; é ser a autoridade que a IA precisa para ter permissão de existir no hospital.

Referências

Ferber D, et al. Towards autonomous medical artificial intelligence agents. Nature. 2026 jun 17. Disponível em: https://www.nature.com/articles/s41586-026-10675-5
Conselho Federal de Medicina. Resolução CFM nº 2.454, de 11 de fevereiro de 2026. DOU 2026 fev 27; Ed. 39, Seção 1, p. 158. Disponível em: https://sistemas.cfm.org.br/normas/arquivos/resolucoes/BR/2026/2454_2026.pdf
Topol E. Agentic AI Comes to Medicine. Ground Truths. 2026. Disponível em: https://erictopol.substack.com/p/agentic-ai-comes-to-medicine
General-purpose large language models outperform specialized clinical AI tools on medical benchmarks. Nature Medicine. 2026. Disponível em: https://www.nature.com/articles/s41591-026-04431-5
Assessment of Large Language Models in Clinical Reasoning (Script Concordance Testing). NEJM AI. 2026. Disponível em: https://ai.nejm.org/doi/full/10.1056/AIdbp2500120
Assessing the Capability of LLMs in Answering Pediatric Critical Care Board-Style Questions. PMC/NCBI. 2025–26. Disponível em: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12637821/

Agentic AI · Regulation · Clinical Practice

Autonomous Agents vs. CFM 2.454: The AI That Acts and the Law That Demands the Physician Decide

In June, an AI agent outperformed experienced clinicians operating on its own inside an EHR. Two months later, Brazil’s medical regulation banning exactly that kind of autonomy takes effect — the collision is the frontier where Brazilian medicine will be redrawn.

Dr. Mbula Luzingu Barros | Pediatric Intensivist · 25 years in Pediatric ICU | AI Healthcare Consultant | Founder of INOVAMED | Creator of the AIMED Methodology

📅 Published June 22, 2026

⏱ CFM 2.454 enforcement in: 65 days — August 26, 2026

Why This Cross-Reading Matters

On June 17, 2026, Nature published a study worth reading before it becomes a distorted headline. Ferber et al. presented MIRA (Medical Intelligence for Reasoning and Action), an AI agent operating inside a sandboxed electronic health record. It is not a chatbot that suggests — it is an agent that acts: it takes histories, orders and interprets laboratory, imaging, and microbiology tests, generates differential diagnoses, and formulates management plans, including prescribing medication, scheduling surgery, and planning admissions. On real retrospective cases, MIRA outperformed experienced clinicians while adhering to safety standards and guidelines.

Five days later I am writing this, and the question that matters is not “will AI replace the physician?” That question is lazy. The question that matters for those who work on the hospital floor is: what happens when technology that acts on its own meets a law designed to prevent it from acting on its own? That law already exists in Brazil, and its name is CFM Resolution 2.454/2026.

What MIRA Actually Did

An honest reading separates signal from noise. MIRA is no oracle. The authors are explicit: the agent does not replace specialized human teams and did not achieve perfection in all therapeutic choices — specific antibiotic selection was one failure point. Eric Topol, commenting on the finding, placed it alongside AMIE as a landmark of “agentic AI” in medicine, with the same caveat: strict human supervision and patient-level safeguards remain the condition of existence for these tools.

Translating to the bedside

MIRA is the brilliant first-year resident who knows the entire literature, gets the reasoning right most of the time — and whom you still don’t let prescribe vancomycin alone at three in the morning. The difference is that this “resident” scales to thousands of beds at once. It is precisely the scale that turns an occasional antibiotic error into a public health problem.

⚠ What NOT to conclude from the paper

Outperforming clinicians on simulated retrospective cases is not the same as prospective bedside safety. Retrospective cases have known outcomes and controlled noise; the real shift is ambiguous, dynamic, and carries asymmetric error cost. The result is promising — not a license for autonomous deployment.

The Collision: MIRA × CFM 2.454/2026

On February 11, 2026, Brazil’s Federal Council of Medicine published Resolution 2.454/2026 (Official Gazette Feb 27, 2026), taking effect on August 26, 2026 — 180 days later. So when MIRA appeared in Nature, Brazil already had, on paper, the legal frame defining what of it is usable here. The table below cross-references, capability by capability, what MIRA does against what the resolution requires.

MIRA capability	What CFM 2.454 requires	Verdict
Autonomous history-taking	Decision support; patient transparency about AI use	Compatible with supervision and disclosure
Orders and interprets tests	Physician critical judgment over each output	Compatible with documented review
Generates differential diagnoses	Final decision and full human responsibility	Conditional — autonomous communication prohibited
Prescribes, schedules surgery, admits autonomously	Delegating therapeutic decisions without human mediation is prohibited	Incompatible — high-to-unacceptable risk

The verdict is clear: MIRA, as the paper describes it, is incompatible with clinical deployment in Brazil — not on technical merit, but on degree of autonomy. What survives here is MIRA demoted to a copilot, with the physician in the loop on every management decision.

Where MIRA Would Fall in the Risk Classification

Arts. 12 and 13 establish four risk levels — low, medium, high, and unacceptable — and require the institution (not the vendor) to classify and document the system. Criteria include the model’s autonomy, context criticality, and the level of human intervention in the output. An agent that executes management without mandatory review is precisely what the norm places at the top of the pyramid.

⚠ Critical point: classification is your responsibility

If you currently use any clinical decision support system without a formal risk assessment, you are out of compliance from August 26, 2026 — regardless of what the vendor claims about the product. MIRA imported “as is” would not be a manageable medium-risk system: without demoting its autonomy, it is prohibited use.

“Significant Supervision” Is the Real Work

The most underestimated concept in the resolution is significant supervision. It reverses the burden of proof for safety: it is not enough for the vendor to say “there’s a human in the loop”; it must be proven that the human has real conditions to review — time, interface, clinical context, and the ability to audit what the AI proposed.

And here recent clinical evidence works against the fantasy of total automation. Nature Medicine showed in 2026 that frontier general-purpose LLMs outperformed specialized clinical tools in knowledge, alignment, and real-world queries — the bottleneck is no longer raw model capability. In parallel, clinical reasoning studies show those same models collapse on script concordance testing, which measures how a clinician updates a decision in the face of new, ambiguous information. The model shines on multiple choice and stumbles on the incremental reasoning of the bedside — precisely the intensivist’s terrain.

The Pediatric Gap

The delta few see

MIRA was evaluated mostly in adult populations. The literature is explicit about the pediatric gap: physiological variability across age groups, differences in presentation, and the scarcity of high-quality datasets make pediatric critical care the last territory to be safely “automated.”

For those working in pediatric ICU and cardiac post-op, this is no distant academic limitation — it is the guarantee that, for years, the agent will at most be a copilot. And the pilot who knows how to build and audit the copilot will be the irreplaceable professional.

The Timeline

From the norm’s publication to the autonomous agent

Why the calendar coincidence between MIRA and CFM 2.454 is not accidental.

Feb 11–27, 2026

CFM 2.454/2026 published

Approved Feb 11; published in the Official Gazette Feb 27. 180-day vacatio legis.

➜

Jun 17, 2026 · You are here

MIRA published in Nature

Autonomous agent outperforms clinicians on simulated EHR — five days before this article. Adaptation window still open.

➜

Aug 26, 2026

CFM 2.454 in full force

All duties become enforceable under ethical and disciplinary law. Autonomous agents without medical review become prohibited use.

➜

Thereafter

State CRM enforcement

Regional Councils begin active oversight. Dual CFM + ANVISA compliance is the real scenario for high-risk systems.

What to Do on Monday

From regulatory theory to bedside conduct☐Migrate from “informal AI” to institutional AI. Personal ChatGPT on the phone during rounds dies with 2.454. What survives is corporate, traceable AI, with a record of who proposed what and who decided.
☐Design the supervision interface, not just the model. Compliance is not in the algorithm — it is in the screen where the physician reviews. Time, context, and audit trail are product requirements.
☐Document the human differential. Where the model errs (specific antibiotics, incremental reasoning, pediatrics) is where the physician adds measurable value. Recording this builds your service’s ethical and legal defense.
☐Classify and document the risk of every system in use. Before August 26, based on Art. 12 criteria. Without that formal assessment, you are out of compliance regardless of the vendor.

Understand Agentic AI and CFM 2.454 at the Depth They Deserve

The AIMED methodology develops physicians who use AI with critical rigor, regulatory awareness, and real clinical application — including governance and compliance in practice, with an implementation framework and applied cases.

Discover AIMED →

Final Considerations

MIRA is not a threat to the Brazilian physician. It is a mirror. It shows exactly what the machine already does better — and, by contrast, illuminates what remains ours: decision under uncertainty, responsibility for the concrete patient, and the ability to audit the tool itself.

Resolution 2.454 did not stop the future; it merely required that the future carry a signed name beneath it. And that name, by law and by merit, remains the physician’s.

💡 Connecting the Dots: the calendar coincidence is strategic. MIRA proves that technical autonomy has arrived; 2.454 guarantees that legal responsibility does not automate alongside it. Professional value migrates from clinical execution (which the machine copies) to governance of the machine — whoever can classify the risk, design significant supervision, and audit the agent. The intensivist’s next competitive edge is not diagnosing faster than the AI; it is being the authority the AI needs in order to be allowed to exist in the hospital.

References

Ferber D, et al. Towards autonomous medical artificial intelligence agents. Nature. 2026 Jun 17. Available at: https://www.nature.com/articles/s41586-026-10675-5
Federal Council of Medicine (CFM). Resolution CFM No. 2,454, of February 11, 2026. Official Gazette 2026 Feb 27; Ed. 39, Sec. 1, p. 158. Available at: https://sistemas.cfm.org.br/normas/arquivos/resolucoes/BR/2026/2454_2026.pdf
Topol E. Agentic AI Comes to Medicine. Ground Truths. 2026. Available at: https://erictopol.substack.com/p/agentic-ai-comes-to-medicine
General-purpose large language models outperform specialized clinical AI tools on medical benchmarks. Nature Medicine. 2026. Available at: https://www.nature.com/articles/s41591-026-04431-5
Assessment of Large Language Models in Clinical Reasoning (Script Concordance Testing). NEJM AI. 2026. Available at: https://ai.nejm.org/doi/full/10.1056/AIdbp2500120
Assessing the Capability of LLMs in Answering Pediatric Critical Care Board-Style Questions. PMC/NCBI. 2025–26. Available at: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12637821/

Agentes Autônomos vs. CFM 2.454: A IA que Age e a Lei que Exige que o Médico Decida

Por Que Este Cruzamento Importa

O Que a MIRA Realmente Fez

Traduzindo para o beira-leito

⚠ O que NÃO se deve concluir do paper

A Colisão: MIRA × CFM 2.454/2026

Onde a MIRA Cairia na Classificação de Risco

⚠ Ponto crítico: a classificação é obrigação sua

A “Supervisão Significativa” é o Verdadeiro Trabalho

O Buraco Pediátrico

O delta que poucos veem

A Linha do Tempo

Da publicação da norma ao agente autônomo

O Que Fazer na Segunda-Feira

Da teoria regulatória à conduta de plantão

Entenda a IA Agêntica e a CFM 2.454 na Profundidade que Elas Merecem

Considerações Finais

Referências

Autonomous Agents vs. CFM 2.454: The AI That Acts and the Law That Demands the Physician Decide

Why This Cross-Reading Matters

What MIRA Actually Did

Translating to the bedside

⚠ What NOT to conclude from the paper

The Collision: MIRA × CFM 2.454/2026

Where MIRA Would Fall in the Risk Classification

⚠ Critical point: classification is your responsibility

“Significant Supervision” Is the Real Work

The Pediatric Gap

The delta few see

The Timeline

From the norm’s publication to the autonomous agent

What to Do on Monday

From regulatory theory to bedside conduct

Understand Agentic AI and CFM 2.454 at the Depth They Deserve

Final Considerations

References

DEIXE UM COMENTÁRIO Cancelar resposta