Register and share your invite link to earn from video plays and referrals.

Search results for hunyuan
hunyuan community
One keyword maps to one global community path.
Create community
People
Not Found
Tweets including hunyuan
I was paying for Meshy. Then I found this repo and cancelled my subscription the same day. Modly is a free, open source desktop app that turns any photo into a 3D model using local AI running entirely on your GPU. No cloud. No account. No credits. Drop in a photo. The app removes the background automatically. A local image-to-3D model reconstructs the mesh on your machine. Export as GLB, OBJ, STL or PLY. → Runs Hunyuan3D 2 Mini by Tencent locally, with Microsoft's Trellis and Tripo AI's TripoSG coming next → Unlimited generations, your images never touch a server → Built-in 3D viewer, collections workspace, multiple export formats → Works with Unity, Unreal, Godot, Blender, ZBrush, Cura, PrusaSlicer, Bambu Studio → Hackable extension system, swap the model or modify the pipeline yourself → Windows and Linux installers ready, macOS coming soon Cloud services charge $20 to $60 a month and upload every photo you give them. This does the same job for free on hardware you already own. 3.5k stars. MIT License. 100% Opensource.
Show more
🏠 Describe a room in plain text, and out comes a complete, physics-ready scene a robot can actually interact with. That's SceneSmith, an ICML 2026 Spotlight from MIT and Toyota Research Institute. Title: nepfaff/scenesmith (SceneSmith) URL: 🏠 Overview SceneSmith is an agentic system that generates simulation-ready indoor scenes from natural language. It produces furniture, wall-mounted mirrors and artwork, ceiling chandeliers, and small tabletop items — all with physical properties like mass and inertia — so the scenes can be used directly for robot training and evaluation. ❓ Challenges Solved Building realistic indoor scenes for robot simulation has meant manual modeling or tedious scene composition, a major bottleneck for scaling robot evaluation and training. SceneSmith removes this by automatically generating diverse, contextually coherent scenes from text prompts. 💡 Methodology & Approach Scene generation runs as a five-stage sequential pipeline. ・Floor plan generation (walls and floor layout) ・Large furniture placement ・Wall-mounted objects (mirrors, artwork, shelves, clocks) ・Ceiling fixtures (chandeliers, pendant lights, ceiling fans) ・Manipulable small objects Checkpoints are saved automatically after each stage, so you can resume or branch midway. Scene reasoning and task decomposition use a VLM agent (GPT-5). 🎯 Use Cases & Tech ・3D assets are generated with the high-quality SAM3D (recommended) or Hunyuan3D-2, with retrieval from HSSD and Objaverse also supported ・AmbientCG PBR materials are applied via CLIP-based semantic search, and articulated objects from ArtVIP and PartNet-Mobility are handled with joint kinematics ・Output is native Drake format, with export to MuJoCo, USD, and Isaac Sim 📊 Highlights ・From a task like "find a fruit from the bowl and place it on a plate," it generates multiple constrained scene variations and supports robot evaluation ・A 151-word prompt yields a community center, even inferring context like placing ping pong paddles and balls near the table ・Geometry generation is distributed across GPUs, with bubblewrap isolation preventing rendering OOM #Robotics# #SceneGeneration#
Show more