Get Inspired
Educate yourself
Start the journey
Collaborate
Riktlinjerna ger dig inom offentlig förvaltning vägledning om att använda generativ AI i din verksamhet. I Kom igång med riktlinjerna guidar vi dig hur du kan ta dig an riktlinjerna och arbetet med generativ AI. |
|
Upptäck hur AI-applikationer kan uppnå GDPR-efterlevnad. Den här guiden från AIUC täcker dataskyddsprinciper, säkerhetsåtgärder, automatiserad samtyckeshantering och framtida riktlinjer för AI- och GDPR-samspelet. Läs mer i artikeln AI och GDPR-efterlevnad: Hur man skyddar data i AI-applikationer |
Today we introduced a research preview of Operator(opens in a new window), an agent that can go to the web to perform tasks for you. Powering Operator is Computer-Using Agent (CUA), a model that combines GPT-4o's vision capabilities with advanced reasoning through reinforcement learning. CUA is trained to interact with graphical user interfaces (GUIs)—the buttons, menus, and text fields people see on a screen—just as humans do. This gives it the flexibility to perform digital tasks without using OS- or web-specific APIs. CUA builds off of years of foundational research at the intersection of multimodal understanding and reasoning. By combining advanced GUI perception with structured problem-solving, it can break tasks into multi-step plans and adaptively self-correct when challenges arise. This capability marks the next step in AI development, allowing models to use the same tools humans rely on daily and opening the door to a vast range of new applications. While CUA is still early and has limitations, it sets new state-of-the-art benchmark results, achieving a 38.1% success rate on OSWorld for full computer use tasks, and 58.1% on WebArena and 87% on WebVoyager for web-based tasks. These results highlight CUA’s ability to navigate and operate across diverse environments using a single general action space. We’ve developed CUA with safety as a top priority to address the challenges posed by an agent having access to the digital world, as detailed in our Operator System Card. In line with our iterative deployment strategy, we are releasing CUA through a research preview of Operator at operator.chatgpt.com(opens in a new window) for Pro Tier users in the U.S. to start. By gathering real-world feedback, we can refine safety measures and continuously improve as we prepare for a future with increasing use of digital agents. |
I’m attending Techarena 2025! Are you joining Scandinavia’s biggest tech and business event? Don’t miss out! Register today! hashtag#Techarena2025 |
This is not a cooked demo. This is a real AI Agent that solves Github issues autonomously with remarkable efficiency. You should block a few minutes to watch… | 244 comments on LinkedIn |
Projektets mål
Projektresultat kommer över tid publiceras här. Projektledare: adilhan.adil@ccgeurope.com |
On (some of) the latest export controls With so many rules, one thing is nearly certain: Sooner rather than later, we will need yet more rules to fix the problems with these rules.
A comprehensive overview of US export regulations on AI hardware. The analysis covers recent regulatory changes (which are complex and likely to evolve alongside AI technology) while placing them in a historical context. |
2025-01-26 06:00
MarkTechPost
Advancements in multimodal intelligence depend on processing and understanding images and videos. Images can reveal static scenes by providing information regarding details such as objects, text, and spatial relationships. However, this comes at the cost of being extremely challenging. Video comprehension involves tracking changes over time, among other operations, while ensuring consistency across frames, requiring […] The post Alibaba Researchers Propose VideoLLaMA 3: An Advanced Multimodal Foundation Model for Image and Video Understanding appeared first on MarkTechPost. |
2025-01-26 03:51
MarkTechPost
The artificial intelligence (AI) landscape is evolving rapidly, but this growth is accompanied by significant challenges. High costs of developing and deploying large-scale AI models and the difficulty of achieving reliable reasoning capabilities are central issues. Models like OpenAI’s GPT-4 and Anthropic’s Claude have pushed the boundaries of AI, but their resource-intensive architectures often make […] The post ByteDance AI Introduces Doubao-1.5-Pro Language Model with a ‘Deep Thinking’ Mode and Matches GPT 4o and Claude 3.5 Sonnet Benchmarks at 50x Cheaper appeared first on MarkTechPost. |
2025-01-26 02:07
MarkTechPost
AI has entered an era of the rise of competitive and groundbreaking large language models and multimodal models. The development has two sides, one with open source and the other being propriety models. DeepSeek-R1, an open-source AI model developed by DeepSeek-AI, a Chinese research company, exemplifies this trend. Its emergence has challenged the dominance of […] The post DeepSeek-R1 vs. OpenAI’s o1: A New Step in Open Source and Proprietary Models appeared first on MarkTechPost. |
2025-01-26 01:57
MarkTechPost
As large language models (LLMs) continue to evolve, understanding their ability to reflect on and articulate their learned behaviors has become an important aspect of research. Such capabilities, if harnessed, can contribute to more transparent and safer AI systems, enabling users to understand the models’ decision-making processes and potential vulnerabilities. One of the biggest challenges […] The post This AI Paper Explores Behavioral Self-Awareness in LLMs: Advancing Transparency and AI Safety Through Implicit Behavior Articulation appeared first on MarkTechPost. |
2025-01-25 17:07
MarkTechPost
As the adoption of generative AI continues to expand, developers face mounting challenges in building and deploying robust applications. The complexity of managing diverse infrastructure, ensuring compliance and safety, and maintaining flexibility in provider choices has created a pressing need for unified solutions. Traditional approaches often involve tight coupling with specific platforms, significant rework during […] The post Meta AI Releases the First Stable Version of Llama Stack: A Unified Platform Transforming Generative AI Development with Backward Compatibility, Safety, and Seamless Multi-Environment Deployment appeared first on MarkTechPost. |
2025-01-25 13:00
Wired
You might assume that tech-savvy people are the most open to using AI, but research suggests it's actually those who are least familiar with it. |
2025-01-21 17:59
ScienceDaily
Scientists have developed a computing chip that can learn, correct errors, and process AI tasks. |
Modulai has, in collaboration with Lindex, developed and deployed a custom-built recommender system. Lindex is dedicated to offer their customers a relevant and transparent personalized experience, in a multi-channel context. To be able to deliver on that front, a robust recommender system is considered a vital cornerstone.
Engineering students looking for a real-world challenge
2024-05-01
-
2025-06-06
|
|
SkillStation söker unga mellan 18-25 år
2024-11-06
-
2025-05-31
|