• Marvel Rivals: Chrono Shield Explained
    gamerant.com
    Marvel Rivals' competitive mode brings an interesting twist to the ranking system with the introduction of the Chrono Shield, and it can be quite tricky to figure out what its purpose is at first. For players looking to know more about it, here is what they need to know.
    0 Comments ·0 Shares ·110 Views
  • Things Squid Game Season 2 Does Better Than Season 1
    gamerant.com
    Squid Game Season 2 takes things to a whole new level, bringing fans an exhilarating follow-up to the global phenomenon that took the world by storm. This season has a lot more action, better villains, deeper character arcs, and more when compared to its previous installment. Though fans are debating over which season is better, there's no denying that they all appreciate the screen time that the Front Man got this time, along with the determination of the players to fight back.
    0 Comments ·0 Shares ·88 Views
  • One Piece: Big Mom's Massive Elbaf Role, Explained
    gamerant.com
    Big Mom is one of the most important characters in One Piece, and someone that fans absolutely relished as a villain in the post-time skip of the series. Even though Big Mom turned out to be the secondary antagonist of the Wano Country Arc, she was still absolutely phenomenal to watch. It is a shame that Big Mom was not dealt with properly by Oda and essentially rushed off the screen simply because he wanted to put more focus on Kaido, due to Wano being his arc.
    0 Comments ·0 Shares ·102 Views
  • Game design strategy
    gamedev.net
    I have started out with idea that all chess games out here are missing fun.The game itself is rewarding and also healthy if we consider nowadays sneaky tactics to manipulate our dopamine levels.So I have created a game that is rewarding and player feels progression and wont feel stuck.And wont feel stupid - so he wont churn from learning to doomscrooling.Iam looking for beta testers to try it and tell me what they think, what they would change.tr
    0 Comments ·0 Shares ·102 Views
  • Oris ProPilot X Calibre 115 Year of the Snake Limited Edition Watch Flaunts Chinese Cyan Details
    www.yankodesign.com
    There are certain timeless watches that display fine craftsmanship and draw power from iconic in-house movements: The Oris ProPilot X Calibre 115 is one of them. The contemporary Oris watch flaunting titanium case and bracelet, and 10-day non-linear power reserve indicator on a fully skeletonized dial is a fine example of independent Swiss mechanical watchmaking, which has now received a zodiac makeover to celebrate the Chinese New Year.The special edition Oris ProPilot X Year of the Snake watch is limited to only 88 pieces and exudes a very attractive green hue inspired by the year of the Wood Snake, which 2025 is according to the Chinese zodiac calendar. This is the second year of the snake edition watch we have discussed in as many days: just yesterday we covered the red-hot dialed Longines Conquest Heritage with and artistic caseback.Designer: OrisThis special version of the Oriss most innovative watches has been decorated in cyan detailing, which is inspired by the color linked with the Chinese zodiac element of wood. For a more interesting detail, the signature 10-day power reserve at 3 oclock has been provided with a golden snake-tongue seconds hand that stands out of the symbolic green skeletonized dial.The other things including the innards of the watch remain the same as its base model. Constructed from multi-piece titanium, the watch features a 44mm case and has sapphire glass with double-sided anti-reflective coating. The dial reveals large Super-LumiNova-filled hands for hours and minutes with a subsidiary second at 7:30.The watch runs on the skeletonized Calibre 115 in-house movement which is hand-wound and has a large 10-day power reserve indicator on the dial showing the time you have before giving the watch a wind. The Oris ProPilot X Year of the Snake edition comes on a titanium strap, inside a special carry box. Priced at $9,100 it is limited to only 88 pieces; if you are interested, you may just want to hurry before stock runs out.The post Oris ProPilot X Calibre 115 Year of the Snake Limited Edition Watch Flaunts Chinese Cyan Details first appeared on Yanko Design.
    0 Comments ·0 Shares ·99 Views
  • Massager brings comfort and de-stressing to your abdomen
    www.yankodesign.com
    When its that time of the month or when Im just so stressed (which can also be both at the same time), one of my constant wishes is to get a massage. But of course, its not the best time to have one especially if youre experiencing dysmenorrhea. A comfortable, relaxing massage would be pretty helpful during those times, even if its not an actual, proper massage.Designer: Bebop DesignThe AM1 massager is an abdominal massager that is meant to be more comfortable rather than just functional. In fact, it was designed to be somehow part of the furniture, like you would leave a cushion lying around on your couch or on your bed. And when youre using it, it is meant to be like youre hugging a cushion to your tummy and youre being massaged at the same time. It can also look like a VR headset except that it will be wrapped around your abdomen rather than your head. It follows the body line with smooth silhouette. It has a fabric in the front which brings the comfortable feeling even as the vibration of the massage feature of the device is shaking you up.There isnt other information about this massager, like what features it may have and also other specifications. But as it is, and because of my aforementioned issues during certain times of the month, its something I might want to own someday especially if it is as comfortable and destressing as it promises. The post Massager brings comfort and de-stressing to your abdomen first appeared on Yanko Design.
    0 Comments ·0 Shares ·99 Views
  • Phantom data centers: What they are (or arent) and why theyre hampering the true promise of AI
    venturebeat.com
    So-called fake data centers are a bottleneck to scaling infrastructure to keep up with compute demand for AI and other critical workloads.Read More
    0 Comments ·0 Shares ·101 Views
  • FutureHouse Researchers Propose Aviary: An Extensible Open-Source Gymnasium for Language Agents
    www.marktechpost.com
    Artificial intelligence (AI) has made significant strides in developing language models capable of solving complex problems. However, applying these models to real-world scientific challenges remains difficult. Many AI agents struggle with tasks requiring multiple cycles of observation, reasoning, and action. Moreover, existing models often lack the ability to integrate tools effectively or maintain consistency in multi-step reasoning. These issues are particularly pressing in scientific domains, where tasks demand precision, adaptability, and computational efficiency. Addressing these problems requires a flexible and practical framework for training and deploying language agents.Introducing Aviary: An Extensible Open-Source GymnasiumA team of researchers from FutureHouse Inc., the University of Rochester, and the Francis Crick Institute has introduced Aviary, an open-source gymnasium for language agents. Aviary addresses the limitations of existing frameworks by introducing language decision processes (LDPs), which model tasks as partially observable Markov decision processes grounded in natural language. This approach enables language agents to effectively handle complex, multi-step reasoning tasks.Aviary includes five environments, three of which are designed for advanced scientific tasks:Molecular Cloning: Manipulating DNA constructs using tools for sequence annotation and protocol planning.Scientific Literature QA: Retrieving and analyzing scientific literature to answer detailed research questions.Protein Stability Engineering: Proposing protein mutations to improve stability with the help of computational and biochemical tools.These tasks make Aviary a valuable platform for training and evaluating language agents in real-world scenarios requiring reasoning, tool integration, and iterative learning.Technical Insights and Benefits of AviaryAviary uses a stochastic computation graph framework to model language agents, enabling flexible and efficient optimization. Key features include:Expert Iteration (EI): A training method that iteratively refines agents using high-quality trajectories.Majority Voting: A technique to improve accuracy by combining multiple inference outputs without excessive computational overhead.Tool Integration: Built-in support for tools like sequence annotators and literature retrieval systems, enhancing real-world applicability.The researchers show that non-frontier, open-source models like Llama-3.1-8B-Instruct can achieve performance comparable to or better than frontier models (e.g., Claude 3.5 Sonnet) in these environments. Additionally, these models operate at significantly lower inference costs, making them accessible for large-scale scientific applications.Results and InsightsAviary-trained agents demonstrate impressive performance:On molecular cloning tasks, the Llama-3.1-8B-Instruct agent showed notable accuracy improvements through EI and behavior cloning, outperforming human experts on SeqQA benchmarks.In scientific literature QA tasks, the same model achieved performance levels on par with or better than humans, while maintaining efficiency.Majority voting further enhanced accuracy, with SeqQA results reaching 89% after sampling multiple trajectories, surpassing human and frontier model benchmarks.ConclusionAviary represents a thoughtful advancement in the development of language AI agents. By demonstrating that open-source, non-frontier models can excel in scientific tasks, Aviary opens new possibilities for accessible and cost-effective AI research. Its open-source design encourages collaboration, enabling researchers and developers to refine and extend its applications further.With tools and training methods tailored for real-world challenges, Aviary sets a benchmark for how language agents can address complex tasks. It provides a compelling framework for advancing AI-driven scientific exploration and practical problem-solving.Check out the Paper, Technical Details, and GitHub Page. All credit for this research goes to the researchers of this project. Also,dont forget to follow us onTwitter and join ourTelegram Channel andLinkedIn Group. Dont Forget to join our60k+ ML SubReddit. FREE UPCOMING AI WEBINAR (JAN 15, 2025): Boost LLM Accuracy with Synthetic Data and Evaluation IntelligenceJoin this webinar to gain actionable insights into boosting LLM model performance and accuracy while safeguarding data privacy.The post FutureHouse Researchers Propose Aviary: An Extensible Open-Source Gymnasium for Language Agents appeared first on MarkTechPost.
    0 Comments ·0 Shares ·101 Views
  • This AI Paper Introduces SWE-Gym: A Comprehensive Training Environment for Real-World Software Engineering Agents
    www.marktechpost.com
    Software engineering agents have become essential for managing complex coding tasks, particularly in large repositories. These agents employ advanced language models to interpret natural language descriptions, analyze codebases, and implement modifications. Their applications include debugging, feature development, and optimization. The effectiveness of these systems relies on their ability to handle real-world challenges, such as interacting with extensive repositories and executing tests to validate solutions, making the development of such agents both exciting and challenging.Lack of comprehensive training environments is one of the primary challenges in this domain. Many existing datasets and benchmarks, such as SWE-Bench and R2E, either focus on isolated problems or rely on synthetic instructions that do not represent the complexities of real-world coding tasks. For instance, while SWE-Bench offers test cases for validation, its training dataset lacks executable environments and dependency configurations. This discrepancy limits the utility of existing benchmarks for training agents capable of addressing the nuanced challenges of software engineering.Efforts to address these challenges have previously relied on tools like HumanEval and APPS, which support isolated task evaluation but fail to integrate repository-level complexities. These tools often lack a coherent link between natural language problem descriptions, executable codebases, and comprehensive testing frameworks. As a result, there is a pressing need for a platform that can bridge these gaps by offering real-world tasks within functional and executable environments.Researchers from UC Berkeley, UIUC, CMU, and Apple have developed SWE-Gym, a novel environment tailored for training software engineering agents. SWE-Gym integrates 2,438 Python tasks sourced from GitHub issues across 11 repositories, offering pre-configured executable environments and expert-validated test cases. This platform introduces a groundbreaking approach by combining real-world task complexity with automated testing mechanisms, creating a more effective training ecosystem for language models.SWE-Gyms methodology focuses on replicating real-world coding conditions. The tasks are derived from GitHub issues and paired with the corresponding repository snapshots and unit tests. Dependencies for each task are meticulously configured, ensuring the accuracy of the executable environment. These configurations were semi-manually validated through rigorous processes involving around 200 human annotation hours and 10,000 CPU core hours, resulting in a robust training dataset. The researchers also introduced a subset of 230 tasks, SWE-Gym Lite, which targets simpler and self-contained problems, enabling rapid prototyping and evaluation.The performance evaluation of SWE-Gym demonstrated its significant impact on training software engineering agents. Using the Qwen-2.5 Coder model, fine-tuned agents achieved marked improvements in resolving tasks on SWE-Bench benchmarks. Specifically, resolve rates increased from 20.6% to 32.0% on SWE-Bench Verified and from 15.3% to 26.0% on SWE-Bench Lite. These gains represent a significant leap over previous benchmarks for open-weight language models. Furthermore, SWE-Gym-supported agents reduced failure rates in stuck-in-loop scenarios by 18.6% and improved task completion rates in real-world settings.The researchers also explored inference-time scaling by employing a verifier trained on agent trajectories sampled from SWE-Gym. This approach allowed agents to generate multiple solution trajectories for a given problem, selecting the most promising one using a reward model. The verifier achieved a Best@K score of 32.0% on SWE-Bench Verified, demonstrating the environments capacity for improving agent performance through scalable compute strategies. These results emphasize the potential of SWE-Gym to enhance both the development and evaluation of software engineering agents.SWE-Gym is a pivotal tool in advancing research on software engineering agents. Addressing the limitations of prior benchmarks and offering a scalable, realistic environment equips researchers with the resources needed to develop robust models capable of solving complex software challenges. With its open-source release, SWE-Gym paves the way for significant advancements in the field, setting new standards for the training and evaluation of software engineering agents.Check out the Paper and GitHub. All credit for this research goes to the researchers of this project. Also,dont forget to follow us onTwitter and join ourTelegram Channel andLinkedIn Group. Dont Forget to join our60k+ ML SubReddit. Nikhil+ postsNikhil is an intern consultant at Marktechpost. He is pursuing an integrated dual degree in Materials at the Indian Institute of Technology, Kharagpur. Nikhil is an AI/ML enthusiast who is always researching applications in fields like biomaterials and biomedical science. With a strong background in Material Science, he is exploring new advancements and creating opportunities to contribute. Follow us on X (Twitter) to get regular AI Research and Dev Updates here...
    0 Comments ·0 Shares ·105 Views
  • Daily Deals: The Legend of Heroes: Trails Through Daybreak, EPOMAKER Shadow-X Keyboard, Samsung 98" TV, and More
    www.ign.com
    The first weekend of 20245 is here, which makes today a great time to check out the latest deals! Here are the best deals for Saturday, January 4.The Legend of Heroes: Trails Through Daybreak for $41.99The Legend of Heroes: Trails through Daybreak: Deluxe Edition - Nintendo SwitchThe Legend of Heroes: Trails Through Daybreak is on sale this weekend at Amazon for $41.99. This entry in the long-running Trails series is a solid place to start, especially with Daybreak II due out next month. If you're searching for your next RPG, this is a great option!Marvel vs. Capcom Fighting Collection: Arcade Classics for $34Marvel vs. Capcom Fighting Collection: Arcade Classics - PlayStation 4Marvel vs. Capcom Fighting Collection: Arcade Classics - Nintendo SwitchYou can score Marvel vs. Capcom Fighting Collection: Arcade Classics for only $34 today at Amazon. This collection packs in seven different titles, including the beloved Marvel vs. Capcom 2: New Age of Heroes. At last, you can play these classic titles on modern platforms. EPOMAKER Shadow-X Gasket Mechanical Keyboard for $42.99EPOMAKER Shadow-X Gasket Mechanical Keyboard with ScreenThis EPOMAKER Shadow-X keyboard is perfect to switch up your setup with some color. The keyboard can be used either wirelessly or wired, with a 3,000mAh battery to support hours of use. Additionally, there's even a color screen that displays settings, specs, and more at just a glance. Belkin MagSafe 3-in-1 Charger Stand for $65.99Belkin MagSafe Charger, 3-in-1 Wireless Charging StandIf you own an iPhone, Apple Watch, and a pair of Apple AirPods, this is the ultimate accessory for you. The Belkin MagSafe 3-in-1 Charger Stand can charge all of your devices wirelessly with one device. It's perfect for placing on your nightstand, or even for bringing with you during a trip away from home. Say goodbye to the days of one cord per device.Dragon Quest Illustrations: 30th Anniversary Edition for $23.82Dragon Quest Illustrations: 30th Anniversary EditionFeaturing 240 pages of artwork from Akira Toriyama, Dragon Quest Illustrations: 30th Anniversary Edition is the ultimate gift for any fan of the iconic RPG series. This book features over 500 different illustrations from Toriyama, stretching from Dragon Quest all the way to Dragon Quest XI. Samsung 98" TV for $1997.99SAMSUNG 98-Inch Class 4K Crystal UHD DU9000 Series HDR Smart TVThis Samsung 98-Inch Class 4K Crystal DU9000 Series TV has hit an all-time low this weekend, priced at $1997.99. Not everyone has room for a massive 98" TV, but if you do, this is a solid option, especially for the price. Apple Watch Series 10 for $359Apple Watch Series 10 [GPS 46mm case] SmartwatchAmazon has the Apple Watch Series 10 on sale for $359 this weekend, which nets you $70 off this extremely popular device. Series 10 marked Apple's first wide-angle OLED display on Apple Watch, with the device itself being the thinnest watch yet. If you're not an Apple Watch owner or someone who has an older model, this is the perfect time to score an upgrade.Persona 5 Royal for $14.88Persona 5 Royal - Nintendo Switch (Digital)Walmart has digital Nintendo Switch copies of Persona 5 Royal available on sale for only $14.88. Acting as the definitive version of P5, Persona 5 Royal is one of the must-play RPG experiences of the last generation. This game offers well over 100 hours of content, making this an excellent deal.
    0 Comments ·0 Shares ·99 Views