Research Scientist, Interpretability

Company: Anthropic Limited
Location: San Francisco
Posted on: April 26, 2024

Job Description:

When you see what modern language models are capable of, do you wonder, "How do these things work? How can we trust them?"
The Interpretability team at Anthropic is working to reverse-engineer how trained models work because we believe that a mechanistic understanding is the most robust way to make advanced systems safe. We're looking for researchers and engineers to join our efforts.
People mean many different things by "interpretability". We're focused onmechanistic interpretability, which aims to discover how neural network parameters map to meaningful algorithms. If you're unfamiliar with this type of research, you might be interested in, or. (For a broader overview of work in this space, one of our team's alumni maintains a.)
Some useful analogies might be to think of us as trying to do "biology" or "neuroscience" of neural networks, or as treating neural networks as binary computer programs we're trying to "reverse engineer".
Some of our team's notable publications include,, and. This work builds on ideas from members' work prior to Anthropic such as the,,, and.
We aim to create a solid foundation for mechanistically understanding neural networks and making them safe (see our). In the short term, this means a we focus a lot of our attention on the issue of "superposition" (see,, and our). But this is just a stepping stone towards our goal of mechanistically understanding neural networks.

Responsibilities:

Develop methods for understanding LLMs by reverse engineering algorithms learned in their weights
Design and run robust experiments, both quickly in toy scenarios and at scale in large models
Build infrastructure for running experiments and visualizing results
Work with colleagues to communicate results internally and publicly

You may be a good fit if you:
- Have a strong track record of scientific research (in any field), and have donesomework on Interpretability
- Enjoy team science - working collaboratively to make big discoveries
- Are comfortable with messy experimental science. We're inventing the field as we work, and the first textbook is years away
- You view research and engineering as two sides of the same coin. Every team member writes code, designs and runs experiments, and interprets results
- You can clearly articulate and discuss the motivations behind your work, and teach us about what you've learned. You like writing up and communicating your results, even when they're null
  
  Familiarity with Python is required for this role.
  
  #J-18808-Ljbffr

Keywords: Anthropic Limited, Arden-Arcade , Research Scientist, Interpretability, Other , San Francisco, California

Click here to apply!

Didn't find what you're looking for? Search again!

Let San Francisco recruiters find you. Post your resume for free!

Get San Francisco Other jobs via email.

View more Arden-Arcade Other jobs

Other Other Jobs

Principal Secure System on Chip Architect
Description: Job DescriptionAt Boeing, we innovate and collaborate to make the world a better place. From the seabed to outer space, you can contribute to work that matters with a company where diversity, equity and (more...)
Company: BOEING
Location: San Ramon
Posted on: 01/1/1970

Medical Assistant - Urgent Care-Walnut Creek - Per Diem - 8 Hour - Variable Shift
Description: Job Description:The Medical Assistant works with the physician and other members of the primary or specialty care team by performing a variety of clinical and administrative patient-related duties, in (more...)
Company: John Muir Health
Location: Walnut Creek
Posted on: 01/1/1970

Remote Family Nurse Practitioner (Field visits required)
Description: JOB DESCRIPTION br Job Summary br The Care Connections Nurse Practitioners focus on screening and preventive primary care services delivered in the home, community, and nursing facility settings. (more...)
Company: Molina Healthcare
Location: Colusa
Posted on: 01/1/1970

Salary in Arden-Arcade, California Area | More details for Arden-Arcade, California Jobs |Salary

Travel Ultrasound Technologist ($2700-$3100/Week)
Description: Travel Ultrasound Technologist 2700- 3100/Week Company: Vetted Health States: All 50 states nationwide Overview:
Company: Vetted Health
Location: Newark
Posted on: 01/1/1970

Hair Stylist - Monte Vista Crossing
Description: Join a locally owned Great Clips - salon, the world's largest salon brand, and be one of the GREATS Whether you're new to the industry or have years behind the chair---great opportunities await Interested (more...)
Company: Great Clips
Location: Turlock
Posted on: 01/1/1970

Cardiac/Vascular Sonographer
Description: Job Description:Under the direct supervision of the site Physician, provides health care services to assist in the diagnosis and treatment of disease. Produces images of heart muscle and functioning by (more...)
Company: John Muir Health
Location: Walnut Creek
Posted on: 01/1/1970

Sr. Therapeutic Area Specialist, Hematology (Sr. TAS) - Sacramento, CA
Description: Working with UsChallenging. Meaningful. Life-changing. Those aren't words that are usually associated with a job. But working at Bristol Myers Squibb is anything but usual. Here, uniquely interesting (more...)
Company: Disability Solutions
Location: Carson City
Posted on: 01/1/1970

Travel Ultrasound Technologist ($2700-$3100/Week)
Description: Travel Ultrasound Technologist 2700- 3100/Week Company: Vetted Health States: All 50 states nationwide Overview:
Company: Vetted Health
Location: Lincoln
Posted on: 01/1/1970

Part-Time Trash Collector - Nights
Description: Join our team as an Apartment Trash Collector Service Valet and enjoy the convenience of flexible evening hours in your local area. Whether you're looking to supplement your income or earn extra money (more...)
Company: Disability Solutions
Location: San Rafael
Posted on: 01/1/1970

Travel Ultrasound Technologist ($2700-$3100/Week)
Description: Travel Ultrasound Technologist 2700- 3100/Week Company: Vetted Health States: All 50 states nationwide Overview:
Company: Vetted Health
Location: Rocklin
Posted on: 01/1/1970

Loading more jobs...

Research Scientist, Interpretability

Didn't find what you're looking for? Search again!

Other Other Jobs

Log In or Create An Account