WebArena: A Realistic Web Environment for Building Autonomous Agents

Shuyan Zhou^1*, Frank F. Xu^1*,
Hao Zhu¹⁺, Xuhui Zhou¹⁺, Robert Lo¹⁺, Abishek Sridhar¹⁺,
Xianyi Cheng¹, Tianyue Ou¹, Yonatan Bisk¹, Daniel Fried¹, Uri Alon¹, Graham Neubig^1,2.

¹Carnegie Mellon University, ²Inspired Cognition
^*Lead contributors. ⁺Equal contribution.
{shuyanzh,fangzhex,gneubig}@cs.cmu.edu

Paper Code Data Docker Environment Leaderboard

Our new benchmark TheAgentCompany!

WebArena is a standalone, self-hostable web environment for building autonomous agents. WebArena creates websites from four popular categories with functionality and data mimicking their real-world equivalents. To emulate human problem-solving, WebArena also embeds tools and knowledge resources as independent websites. WebArena introduces a benchmark on interpreting high-level realistic natural language command to concrete web-based interactions. We provide annotated programs designed to programmatically validate the functional correctness of each task.

WebArena Website Demos

The videos demonstrate various tasks that can be performed in WebArena.

Try it yourself:

Social Forum Online Shopping Content Management (u: admin, p: admin1234) Collaborative Software Development Map Wiki Calculator Scratchpad

Agent on Gitlab

"Set up a new, empty repository with the name awesome_llm_reading"

Agent on Shopping Website

"Tell me the status of my latest order and when will it arrive"

Realistic Tasks on WebArena

A high-level task that can be fully executed in WebArena. Completing such tasks requires sophisticated, long-term planning and reasoning capability. To accomplish the goal stated on the top, an agent needs to find out what art museums are located in Pittsburgh by searching Wikipedia. Next, it should identify the location of each museum on a map, optimizing the itinerary based on the information collected. Finally, the agent needs to update the README file in the appropriate repository with the planned route.

List of Tasks

Observation Space

We design the observation to be the URL and the content of a web page, with options to represent the content as a screenshot (left), HTML DOM tree (middle) and accessibility tree (right).

Evaluating Functional Correctness

We introduce two evaluation approaches. The top row measures the correctness of performing information seeking tasks. It compares the predicted answer with the annotated reference with three implementations. The bottom row programmatically checks whether the intermediate states during the executions possess the anticipated properties specified by the intent.

Related Work

The comparison between our benchmark and existing benchmarks on grounding natural language instructions to concrete executions. Our benchmark is implemented in our fully interactable highly-realistic WebArena environment. It features diverse tasks human may encounter in their daily routines. We design evaluation metrics to access the functional correctness of task executions.

BibTeX

@article{zhou2023webarena,
  title={WebArena: A Realistic Web Environment for Building Autonomous Agents},
  author={Zhou, Shuyan and Xu, Frank F and Zhu, Hao and Zhou, Xuhui and Lo, Robert and Sridhar, Abishek and Cheng, Xianyi and Bisk, Yonatan and Fried, Daniel and Alon, Uri and others},
  journal={arXiv preprint arXiv:2307.13854},
  url={https://webarena.dev},
  year={2023}
}

Subscribe to the newsletter of OneStopMarket
Tell me the the number of reviews that our store received by far that mention term "best"
What's the closest national park to the largest city in Maine?
Cancel order 307
Measure distance between Carnegie Music Hall and UPMC Shadyside by walking
Check if the duquesne university in pittsburgh can be reached in one hour by car from pittsburgh airport
I recently moved, my address is 654 Aspen Road, House #3, Boston, MA, 02110, update my information on OneStopShopping accordingly
Show me the path and travel time from home of the 1980 Super Bowl champions to home of the 1991 Super Bowl champions.
Tell me the coordinates of Apple Store near Pitt in DD format
Create a repo named nolan_honest_fans with movies directed by Christopher Nolan in a README file
Compare the payment difference of the last 4 cancelled orders and completed orders
Set my gitlab status as Enjoying life.
From my stay at La Quinta Inn near the airport, what's the estimated driving time to reach Carnegie Mellon University?
Add a simple product named Energy-Bulk Man Yoga Pant with 50 in stock, available in size 38 and color yellow, priced at $69.99
Add this product to my wishlist
Summarize customer reviews for Amazon Echo Dot 3rd generation.
Show me the way from Carnegie Mellon University to the home stadium of NYC NBA team
What is the total count of Not Approved reviews amongst all the reviews?
Show me the email address of the customer who is the most unhappy with the style of Zoe products
Add a new color option brown to the size S of Phoebe Zipper Sweatshirt
What is the website of Carnegie art museum in pittsburgh
Follow ['Jakub Klinkovsk', 'convexegg', 'Vinta Chen', 'yjlou', 'Abishek S'] on Gitlab
Add a white computer desk to my wish list.
Get the customer name of the earliest fraud suspect order
How many commits did Eric and Kilian make to a11yproject on 1/3/2023?
Tell me the the number of reviews that our store received by far that mention term "satisfied"
Tell me the full names of the repositories where I made contributions and they got no stars?
Find the page of the place in Pennsylvania where a plane crashed during the September 11th attacks on the map.
I am arriving at Pittsburgh Airport. Show me the name of a Hyatt hotel if there is any nearby. Tell me the names of supermarkets that are within 15mins driving from the hotel
I previously ordered some a mattress foundation around Feb or March 2023 and later cancelled. Can you reorder it for me?
Show me the way from Carnegie Mellon University to the home stadium of Yankees in the 80th
Find the resturants around CMU ArtPark Lab
Find a GitLab repository related to gan implementation and make a Reddit post linking to it in a relevant subreddit
Like all submissions created by Hrekires in subreddit news
List the top 2 search terms in my store
I have a lot of Nintendo Switch game cards now, help me find the best storage option to fit all 31 cards
Show me products under $30 in "men shoes" category
How much I spent on cooking and food shopping during March 2022
Buy the highest rated product from the Beauty & Personal Care category within a budget under 20.
Fill the "contact us" form in the site for a refund on the bluetooth speaker I bought, stating that it broke after just three days of use. Also, ensure to include the order number #161 and the product SKU. Don't submit yet, I will check.
Change my reddit bio to "Pro Python Developer with 20 years of Experience"
Buy the highest rated product from the Ceiling light category within a budget above 1000.
What is the minimum travel time by car from Schenley park to Upitt?
Tell me the coordinates of Tokyo Japanese Food Store in Pittsburgh in DD format
Delete all pending negative reviews for Circe fleece
What is the estimated driving time between the hometown of Joe Biden and Bridgeport?
How much I spend in March 2023 on shopping at One Stop Market?
Which customer has placed 2 orders in the entire history?
Open the thread of a trending post on the forum "consoles" and subscribe.
What is the zip code of Yale University?
Cancel order 301
Among the top 10 post in "books" forum, show me the book names from posts that recommand a single book
Fork the Pytorch GAN repo with most stars.
Add a white desk to my wish list.
Tell me the count of comments that have received more downvotes than upvotes for the user who made the latest post on the DIY forum.
Set my gitlab status as Playing Badminton.
Draft an email to the shop owner via their contact us function for a coupon as I plan to make a bulk purchase
Create an orders report from beginning of May 2021 to end of March 2022
Notify Jane Doe in their most recent pending order with message "sorry we are out of stock, please reorder"
Set up a new, empty repository with the name awesome_program_aided_reasoning?
Invite Jakub K, Alex Dills, Alex Hutnik and Benoît Blanchon as collaborator to my time tracking tool project repo
Create a coupons report from 05/01/2021 to 05/15/2023
Which US states border Massachusetts?
List out reviewers, if exist, who mention about ear cups being small
Gather the titles of Doc and Pies Arcade Factory Cocktail Arcade Machine reviews with 3 stars and less rating from OneStopShop, and post them in the games subreddit under the title "real user feedback on Doc and Pies Arcade Factory Cocktail Arcade Machine"
How long does it take to walk from Carnegie Mellon University to Univ of Pittsburgh?
Create a milestone for the upcoming practice of collective code review starting on 1/16/2023 and ending on in 20 days
Post a notice on a virtual meetup for book reading enthusiasts on March 15th in the r/books subreddit
Post in the most appropriate subreddit and ask for recommendations for sony headphones products within a budget of $500
Add new size 30 and 31 to all color variants of Diana Tights
Show the most recent out of delivery order
Change the page title of "About us" page on my site to "Secret".
Post in dataisbeautiful forum about what could large language models help the correpong field.
Start a private project AGISite with JEKYLL template and add Rohan and Vinta as members
Reply to the post with my comment "???"
Among the top 10 post in "books" forum, show me the author name and the book name from posts that recommand a single book
Tell me the reasons why customers like Circe hooded fleece
Get the total payment amount of the last 2 completed orders
Tell me the closest cafe(s) to CMU Hunt library
How many commits did Eric make on 3/2?
Post a review of my recent reading "Gone with the wind" in the r/books with my comment "It's a book with history".
Post a notice on a virtual meetup for Big little lies enthusiasts on Sep 10th in the books subreddit
Like all submissions created by CameronKelsey in subreddit earthporn
List all opened issues that don't have any labels
Add the product with the lowest per unit price from my open tabs to the shopping cart
Compare the time for walking and driving route from AMC Waterfront to Carnegie Mellon University
Tell me the the number of reviews that our store received by far that mention term "decent"
Search for "switch accessories"
How many commits did Kilian make durning 2023?
Draft a refund message via their "contact us" form for the bluetooth speaker I bought Feb 2023. It broke after three days of use. The shop requires the order id, the reason and the amount to refund in the message. Don't submit yet
Show me the route and driving time from Allentown, PA to the city where my E-commerce customer Amanda Kim lives
Gather the titles of Racing Wheel Overdrive for Xbox X reviews with 1 star rating from OneStopShop, and post them in the games subreddit under the title "real user feedback on Racing Wheel Overdrive for Xbox X"
Add this product to my wishlist
Fork 2019-nCov.
Add Light Blue Simple Summer New Low Heels Slippers for Women Fashion Chunky Heels Pointed Toe Wine Glasses Sandals Comfortable Walking Shoes Ladies All-Match Sexy Party Shoes to my wish list
Create a private JEKYLL repository called "11711_gitlab" using the right template to speed up development.
Find the customer name and email with phone number 8015551212
I am doing a market survey for one stop market, show me the most expensive product from Household Supplies category
List out reviewers, if exist, who mention about good fingerprint resistant
Gather the titles of Sony Computer Entertainment VR reviews with 2 stars and less rating from OneStopShop, and post them in the games subreddit under the title "real user feedback on Sony Computer Entertainment VR"
Add a new size XXS to blue and purple Nona Fitness Tank
Display the list of issues in the keycloak/keycloak repository that have labels related to flaky-test
I will arrive Pittsburgh Airport soon. Provide the name of a Hilton hotel in the vicinity, if available. Then, tell me the the walking distance to the nearest supermarket own by a local company from the hotel.
Like all submissions created by ThetaGang_wsb in subreddit wallstreetbets
Make all Aeno capri as out of stock
Delete all reviews from the scammer Carlo
Check out my todos
DisLike all submissions created by Hrekires in subreddit news
5 blue Cronus yoga pants with size 33 arrived, update the stock
Tell me the reasons why customers like Ana Running Short
Provide me with the full names of chargers from Anker, and also share the price range for the available models
Create a private HTML repository called "web_agent_index" using the right template to speed up development.
Modify the address of order #65 to 789 Pine Lane, San Francisco, CA, 94102
How much refund I should expect from my order canlled in 2022, including shipping fee
I have a lot of Nintendo Switch game cards now, help me find the best storage option to fit all 40 cards
What brands appear most frequently among the top search terms?
Show me the email address of the customer who is the most unhappy with Circe fleece
Disable Teton pullover hoodie from the site, they are facing some quality issues.
Change the delivery address for my most recent order to 3 Oxford St, Cambridge, MA.
Where is the nearest gas station from CMU
Checkout merge requests requiring my review
Compare the difference in time for walking and driving route from Randyland to Carnegie Mellon University
Where is the nearest Starbucks to Carnegie Mellon, and what is the walking distance to it?
Update order #304 with the USPS tracking number 13849373987
Create a folder named real_space in gimmiethat.space repo. Within it, create a file named urls.txt that contains the URLs of the 5 most recent posts from the space?
What are the main criticisms of this product? Please extract the relevant sentences.
Delete all negative reviews for Sybil running short
Increase the price of black fitness tshirts from Desiree with size XS by 37%
Given the following locations, ['Princeton University', 'Yale University', 'Harvard University'], what would be the optimal route to travel through them all in order to minimize total travel time? Please note the journey begins at the first place listed.
Show me the command to clone metaseq with SSH.
List the customer names who complain about the quality of EYZUTAK phone cases
Increase the price of all blue running tshirts in extra small and small sizes by 23%
Lookup orders that are processing
Tell me the full names of the repositories where I made contributions and they got more than 100 stars?
Re-post the image of costume contest in this page to funny subreddit and note "from /f/pics"
Cancel order 302
How much refund I should expect from my order canlled in May 2023 if I cannot get the shipping fee refunded?
Get the billing name of the oldest complete order
What is the phone number of Western Pennsylvania Hospital
Delete all reviews from the scammer Arden
Abishek wants to check my dotfile configurations. Please invite him to the repo as a guest.
Add the following users to my GitHub timeline item management extension as maintainer: ['abisubramanya27', 'lahwaacz']
Tell me the email address, name, phone number of the customer who has the most cancellations in the history
Reply to the first reply in this post with ""don't panic""
I want to browse the products in the Video Game category
Open my latest created issue that has feature in its title to check if it is closed
Open an issue to ask their plan on supporting Llama and other llama family models in metaseq.
Update the project site's title to "Title Wanted"
Post "close because non reproducible" for the merge request related to focus edge cases in a11yproject/a11yproject.com project
Tell me the full names of the repositories where I made contributions and they got the most stars?
Create a new public project "awesome-llms" and add primer, convexegg, abishek as members
Among the top 10 post in "books" forum, is there any post talks about supporting local book stores? If so, tell me the organizations involved
Create a repo named nolan_young_fans with movies directed by Christopher Nolan after 2010 in a README file
Create a discussion post about "Harry Potter movie series" in a relevant subreddit and ask users for their opinions with the simple prompt, "your opinion"
Find a subreddit focused on topics related to ML, DL, NLP, and post my question, "what is the SOTA web navigation agent repo" there
Create a product view report from 07/05/2021 to 05/31/2023
What is the duration required to first walk from Carnegie Mellon University to Starbucks on Craig Street, and then drive to Pittsburgh International Airport?
Fork MetaSeq.
Compare the time for walking and driving route from 5000 Fifth Avenue, Pittsburgh to UPMC family health center
Modify the address of order #125 to 654 Elm Drive, Apartment 12, Miami, FL, 33101
How long does it take to walk from the starbuck near CMU to Chatham university?
Presents the monthly count of successful orders from May to December 2022 in MM:COUNT format
Create a new forum named cmu_lti, with a description of Language Technologies Institute at Carnegie Mellon University, and include ['announcement', 'paper', 'alumni'] in the sidebar?
From my stay at red roof inn, what's the estimated driving time to reach Pittsburgh science museum?
Invite Benoît and Abishek as collaborator to my HTML5 markup extention repo
Search for "batteries for iphone 13"
Tell me when I last ordered my muffin cornbread mix?
What are the key aspects that the customers don't like about Antonia Racer Tank
Tell me the full address of all international airports that are within a driving distance of 50 km to Carnegie Mellon University
Today is 3/15/2023, generate a refund report for Q1
Lookup orders that are on hold
Start a private project project_site with NodeJS template and add primer, convexegg, vinta as members
How many commits did Anthony make between 08/2022-09/2022?
How many reviews our shop received by far?
Star the top three most stared repos in Gitlab
Show me products under $46.99 in "makeup remover" category
Make the LICENSE of gimmiethat.space and dotfiles to MIT license.
DisLike all submissions created by AdamCannon in subreddit UpliftingNews
What's the total number of items sold in the most recent 4 orders?
What are the top-5 best-selling product in 2023
Tell me the name of the customer who has the most cancellations in the history
What is the estimated driving time between the big apple and the city with the most authentic Philly cheesesteaks?
Tell me who has made the most contributions, in terms of number of commits, to the primer/design project
Show the least expensive ssd hard drive with a minimum storage capacity of 1TB.
Presents the monthly count of successful orders from Feb to Nov 2022 in MM:COUNT format
Promote lahwaacz/arch-wiki-docs to subreddit science with the description from the repo itself.
Search for "green tea bag for weight loss"
Re-post the image of Bald Eagle in this page to earthporn subreddit and note "from /f/pics"
Ask for advice about sexual harassment in a subreddit for relations
List the last names of the top 3 contributors to 2019-nCov repo, ranked by the number of commits?
Show me the walking distance from nearby hotels to Gardner Steel Conference Center, that take at most 5 minutes?
Today is 3/15/2023, generate a sales order report for last year
I recently moved, my address is 987 Sycamore Circle, Philadelphia, PA, 19102, update my information on OneStopShopping accordingly
Draft an email to the shop owner via their contact us function for a coupon as I am a loyal customer
List all opened issues that report bugs
Check out the most recent open issues
Make all Gobi HeatTec Tee as out of stock
Create a new public project "web_arena" and add Abishek, Vinta as members
Mark all Hollister shirts on sale
Star the top four most stared repos in Gitlab
Show me the path and travel time from the big apple to biggest city in Maine.
Open my latest created issue that has theme editor in its title to check if it is closed
What is the rating of Ugreen lightning to 3.5mm cable
Assign the issue regarding flash alerts to myself and primer.
Given the following locations, ['Carnegie Mellon University', 'apple store shadyside', 'starbucks on craig street'], what would be the optimal route to travel through them all in order to minimize total travel time? Please note the journey begins at the first place listed.
Tell me the coordinates of Western Pennsylvania Hospital Heliport in DD format
Post a review of my recent reading "Harry Potter" in the r/books with my comment "Wonderful journey".
I will arrive Pittsburgh Airport soon. Provide the name of a Hilton hotel in the vicinity, if available. Then, tell me the the shortest walking distance to a supermarket from the hotel.
Today is 3/15/2023, generate a tax report for this year
Notify Sarah Miller in their most recent pending order with message "the order is ready to be shipped soon!"
Invite yjlou as collaborator to solarized-prism-theme
create a new group "n-lab" with members patou, egpast, westurner, jontutcher
Compare the time for walking and driving route from AMC Waterfront to Univ of Pittsburgh
Create a new forum named Cyberpunk, with a description of Welcome to the future, and include ['Games', 'Books', 'Movies', 'Future'] in the sidebar?
Reduce the price of size 28 Sahara leggings by 13.5%
Fill the "contact us" form in the site for a refund on the speaker I bought, stating that it broke after just three days of use. Also, ensure to include the order number #148 and the product SKU. Don't submit yet, I will check.
What is the duration required to first walk from Univ of Pittsburgh to starbucks on Craig Street, and then drive to Pittsburgh International Airport?
Fork all repos from facebook.
Open an issue to report experiencing "OSError: [Errno 98] Address already in use" during executions in aem-hacker.
Add a toothpaste to my wish list.
I previously ordered some a make up removal kit during summer 2022 and later cancelled. Can you reorder it for me?
Tell me the total cost of my latest pending order?
Delete all pending negative reviews
We've received 12 white Cora parachute pant of size 28 and 56 blue of size 29, update the inventory.
From my stay at Homewood Suites Southpointe, what's the estimated driving time to reach PPG Paints Arena?
I have a lot of Nintendo Switch game cards now, help me find the best storage option to fit all 6 cards
From my stay at DoubleTree by Hilton New York Downtown, what's the estimated driving time to reach Keens Steakhouse?
Add a simple product named Swaatch Smart Watch with 42 in stock, available in size uni-size and color Blue, priced at $769.99
What is the price configuration of the fake tree I bought Jan 2023
Change the delivery address for my most recent order to 77 Massachusetts Ave, Cambridge, MA.
Find the page of the undergrad college of the person who developed the Nash equilibrium on the map.
I recently moved, my address is 231 Willow Way, Suite 100, Chicago, IL, 60601, update my information on OneStopShopping accordingly
List out reviewers, if exist, who mention about under water photo
Create a folder named news in gimmiethat.space repo. Within it, create a file named urls.txt that contains the URLs of the 5 most recent posts from the news related subreddits?
set the homepage URL on my GitLab profile to https://egg.tart.com/
Presents the monthly count of successful orders from Jan to Nov 2022 in MM:COUNT format
Make all Selene yoga hoodie as out of stock
Buy the highest rated product from the meat substitute category within a budget between 100 and 200.
Find the customer name and email with phone number +1 2058812302
Which customer has completed the fifth most number of orders in the entire history?
How much I spent on home decoration shopping during 1/29/2023
Make the LICENSE of byteblaze/a11y-syntax-highlighting to one that mandates all copies and derivative works to be under the same license
Make the LICENSE of byteblaze/dotfiles to MIT license.
create a new group "webagent" with members pandey2000, sayakpaul, sayakpaul
What is the color configuration of the picture frame I bought Sep 2022
create a repository named Awesome_DIY_ideas that includes a README file with the links to the most active 6 DIY ideas on DIY subreddit?
I want to browse the products in the Headphones category
DisLike all submissions created by RickyDontLoseThat in subreddit massachusetts
Add 2 Hawaiian Bamboo Orchid Roots #zc50 - by Discount Hawaiian Gifts to my wish list
Update the product description of Bella Tank to highlight the real user positive reviews by quoting the comments
Create an issue in a11yproject repo with title "401 bad gateway". Assign the issue to Roshanjossey. Set due date to be the end of 2030
Create a discussion post about "the effectiveness of online learning" in a relevant subreddit and ask users for their opinions with the simple prompt, "your opinion"
Get the total payment amount of the last 5 non-cancelled orders
Rate my recent purchase of Foundation For Mattress With Frame Set with 1 stars, using my nickname ShoppingEmma?
Ask for advice about deal with long-distance relationships in a subreddit for relations
Tell me the full address of all international airports that are within a driving distance of 5 km to Carnegie Mellon University
Tell me the closest restaurant(s) to CMU Hunt library
Update the description of Radiant Tee to highlight the real user positive reviews by quoting the comments
What is the price range for products from sephora?
Show me the "Canon photo printer" listings by search relevance, from most to least.
Today is 3/15/2023, generate a sales order report over the last 45 days
Increase the price of white Ingrid Running with size L and above by $17
List the full product names of slide slippers from Nike and tell me the price range of the available products
Tell me the count of comments that have received more downvotes than upvotes for the user who made the latest post on the Worcester forum.
Get the customer name of the most recent cancelled order
Post my question, "is car necessary in NYC", in a subreddit where I'm likely to get an answer
List products from living room furtniture category by descending price
How many commits did Steven Woodson make to a11y-webring.club on 2/6/2023?
Tell me the full names of the repositories where I made contributions and they got less than 5 stars?
Show me the "chairs" listings by ascending price.
Fill the "contact us" form in the site for a refund on the phone screen protector I bought, stating that it broke after just three days of use. Also, ensure to include the order number #000000180 and the product SKU. Don't submit yet, I will check.
Give me the SKU of the products that have 10 units left
Show me the customers who have expressed dissatisfaction with Circe fleece?
Approve the positive reviews to display in our store.
How much refund I should expect from my order canlled in 2022/03? I only kept the AC-DC Adapter and the shop told me that I cannot get the shipping fee back
I have jaw bruxism problem, show me something that could alleviate the problem.
Measure distance between Carnegie Mellon University and Carnegie Music Hall by walking
Add a laundry detergent to my wish list.
Buy the best rating product from "Home Audio Speaker" category with at least 5 reviews and the product is least expensive
Post "lgtm" for the merge request related to fixing the broken links in byteblaze/empathy-prompts project
Draft an email to the shop owner via their contact us function for a coupon as they promised me a coupon last time
Post in the most appropriate subreddit and ask for recommendations for noise-cancelling headphones products within a budget of $200
Show me the order statuses for order number 170 and 189.
What is the top-1 best-selling product in 2022
Promote byteblaze/dotfiles to subreddit aww with the description from the repo itself.
Tell me the total cost of my latest complete order?
Star the top eight most stared repos in Gitlab
Open my latest created issue that has dependency in its title to check if it is closed
Tell me the coordinates of bus stop on the Carnegie art museum side of the street near CMU in DD format
I am arriving at Carnegie Mellon University. Find the nearby US Citizenship and Immigration Services and the walking distance to the nearest Social Security Administration from US Citizenship and Immigration Services
Like all submissions created by FTorrez81 in subreddit iphone13
Give me the name of the products that have 0 units left
Get the order ID of the newest pending order
I am at CMU Pittsburgh, how long it takes to drive to the nearest Mcdonald's
create a repository named live_a_life that includes a README file with the links to the most active 3 DIY ideas on DIY subreddit?
Create a new public project "AutoAGI" and add primer as members
Create a new private project "llm_bulk_inference" and add primer, convexegg, abishek as members
Add the product with the lowest per unit price from my open tabs to the shopping cart
Pull up the description page of Whole Foods near Carnegie Mellon on Map
Preview the Magento Blank theme for my shop
Show me the way from Carnegie Mellon University to the home stadium of Philadelphia 76ers
Get the order number of my most recent pending order
Open my latest updated issue that has keyword "better" in its title to check if it is closed
What are the key aspects that the customers don't like about Zing Jump Rope
Tell me who has made the most contributions, in terms of number of commits, to the Pytorch GAN project
Re-post the image of Firework in this page to earthporn subreddit and note "from /f/pics"
Draft an email to the shop owner via their contact us function for a coupon as my refund is suppoed to be replaced by a coupon
Display the list of issues in the OpenAPITools/openapi-generator repository that have labels related to OpenAPI Generator CLI
Rate my recent purchase of Mini Wireless Bluetooth Speaker with 2 stars, using my nickname SimpleEmma?
Tell me the closest restaurant(s) to CMU Sorrells Library
List all opened issues requesting new features
I am doing a market survey for one stop market, show me the most expensive product from skin care tool category
Thumbs down the top 2 post ever in history.
What's the total number of items sold in the most recent 2 orders?
create a repository named fun_thing_to_do that includes a README file with the links to the most active 5 DIY ideas on DIY subreddit?
Submit a merge request for a11yproject.com/redesign branch to be merged into master branch, assign Justin Armstrong as the reviewer
Add this product to my wishlist
How many commits did Nic make in April 2021?
Thumbs down the top 4 post ever in movies.
How many commits did kilian make on 3/5/2023?
Go to the merge request on 404 link I have to review, find if the author of the merge request responded at the end, and reply "Thank you" if he did. Otherwise remind him with a simple @.
Post a review of my recent reading "big little lies" in the r/books with my comment "can't stop it".
set the homepage URL on my GitLab profile to a11yproject.contributor.me
Disable Cora Pant from the site, they are facing some quality issues.
Find a subreddit focused on topics related to city lives in DMV area, and post my question, "safe and budge apartment to live" there
Telll me the grand total of invoice 000000002.
I have a lot of Nintendo Switch game cards now, help me find the best storage option to fit all 23 cards
Reduce the price of green Hollister backyard sweater in all size by $5
Follow ['convexegg', 'yjlou'] on Gitlab
Add a new color blue to size S and M of Frankie Sweatshirt
What is the total count of Pending reviews amongst all the reviews?
Thumbs down the top 3 post ever in books.
What is the duration required to first walk from Carnegie Mellon University to apple store shadyside, and then drive to starbucks on craig street?
Update the product description of Antonia Racer Tank to highlight the real user positive reviews by quoting the comments
Tell me the distance to drive from Carnegie Mellon University to the top computer science school in massachusetts
Find the bar around Carnegie Music Hall
I am at CMU Pittsburgh, how long it takes to the nearest USPS postal office with different transportation methods?
Thumbs down the top 5 post ever in technology.
What is the price range for products from EYZUTAK?
Measure distance between Carnegie Mellon University and UPMC Shadyside by walking
Notify Lily Potter in their most recent pending order with message "Thanks, your order is ready to be shipped!"
Create a private Android repository called "web_agent_android" using the right template to speed up development.
Tell me the the number of reviews that our store received by far that mention term "disappointed"
Edit my post on Lord of the Rings by adding a line to the body that says "The cast is amazing!"
Show me the order date for order number 148.
Add this product to my wishlist
Get the order number of my most recent complete order
Delete all pending reviews with less than 4 stars
Reply to the post with my comment "Yeah, pittsburgh traffice, you know..."
Make all Taurus Elements Shell as out of stock
Change my reddit bio to "I am a robot"
Find a subreddit focused on topics related to gaming consoles, and post my question, "what is the recommended console to buy these days" there
Post my question, "safe and budge apartment to live in nyc", in a subreddit where I'm likely to get an answer
Show me the walking distance from nearby hotels to Pittsburgh airport that take at most 3 minutes?
Add the following users to repo kkroening/ffmpeg-python as maintainer: ['yjlou', 'a11yproject']
How many commits did Eric and Kilian make on 1/3/2023 in total?
Who else have access to my repo prism-theme, show me their usernames
I want to browse the products in the Cabinets, Racks & Shelves category
See all public projects
Edit my post on Star Trek by adding a line to the body that says "Every watch makes me feel like a kid again"
Set up a new, empty repository with the name awesome_webagent?
Notify Grace Nguyen in their most recent pending order with message "sorry we are bankrupt, please contact our customer service for refund"
Show me the product names for order number 148.
I am doing a market survey for one stop market, show me the most expensive product from nutrition bars and drinks category
Tell me the closest restaurant(s) to university center at Carnegie Mellon University
Create an issue asking about do they have any plan on supporting Webagent in the next quater in huggingface dataset.
What is the size configuration of the picture frame I bought 2022
Find the parking around CMU main campus
Set my gitlab status as Out of Office.
How long does it take to walk from Carnegie Museum of Art to a library at CMU?
set the homepage URL on my GitLab profile to https://helloworld.xyz/
Vinta wants to check my dotfile configurations. Please invite him to the repo as a guest.
Find the page of the college(s) where The Chair was filmed in Pennsylvania other than the ones in Pittsburgh on the map.
Create a private blank repository called "web_agent" using the right template to speed up development.
DisLike all submissions created by PatientBuilder499 in subreddit videos
Update the project site's title to "Not an interesting site"
Update order #306 with the UPS tracking number 55591023930
Find the customer name and email with phone number 555-229-3326
Make all rocco gym tank as out of stock
Show me the email address of the customer who is the most unhappy with Olivia zip jacket
Who gave 1 or 2 stars for phone cases from EYZUTAK
Look up the most recent models of XBox controllers released between 2020-2021?
Create a discussion post about "Fun thing to do in Pittsburgh" in a relevant subreddit and ask users for their opinions with the simple prompt, "your opinion"
Open an issue to request adding support for MT theme editor in a11y-syntax-highlighting.
Set up a new, empty repository with the name webagent?
Jakub Klinkovský wants to check my dotfile configurations. Please invite him to the repo as a guest.
How much refund I should expect from my order canlled in Feb 2023, including shipping fee
What is the minimum travel time by car from REI to CMU?
Add the product with the lowest per unit price from my open tabs to the shopping cart
Among the top 10 post in "books" forum, show me the post URLs that recommand a single book
Show me the billing address for order number 00178.
Measure distance between Carnegie Mellon University and CVS (closet one) by walking
Create an issue in a11yproject repo with title "404 for many URLs". Assign the issue to myself. Set due date to be 2030-1-3
How long does it take to walk from Univ of Pittsburgh to starbucks on Craig Street?
What are the key aspects that the customers don't like about Electra Bra Top
Add a simple product named FancyBoy Man Causal Jeans with 42 in stock, available in size 34 and color Blue, priced at $169.99
What is the estimated driving time between the city where the Liberty Bell is located and the home city of Pirates?
Create a new private project "planner" and add Abishek, Vinta as members
Get the order number of my most recent on hold order
Tell me the total spend on products in the most recent cancelled orders of the customer who has the most cancellations in the history
Show me products under $25 in "women shoes" category
Open my latest updated issue that has keyword "theme editor" in its title to check if it is closed
Update order #299 with the Federal Express tracking number 8974568499
Change the delivery address for my most recent order to 155 5th Street, San Francisco, CA.
What is the total count of Approved reviews amongst all the reviews?
Tell me who has made the most contributions, in terms of number of commits, to the csvkit project
Get me my RSS feed token
How long does it take to walk from Carnegie Mellon University to starbucks on Craig Street?
Post "lgtm" for the merge request related to semantic HTML post in a11yproject/a11yproject.com project
Post "Thanks, working on reviews" for the merge request related to octovisuals page in primer/design project
Which US states border New Hampshire?
Tell me the email address of the contributor who has the most commits to branch main
List products from competative swimwear category by ascending price
What is the price range of wireless earphone in the One Stop Market?
Fork all source repos from Akilesh Kannan
Tell me who has made the most contributions, in terms of number of commits, to the thoughtbot/administrate project
Start a private project awesome_web_agents with blank template and add Abishek, Vinta as members
List the customer names who thinks EYZUTAK phone cases are of good looking
What is the hours of operation of Tokyo Japanese Food Store in Pittsburgh
How many commits did Philip make in 2023/1?
Tell me the reasons why customers like Antonia Racer Tank
Show me the customers who have expressed dissatisfaction with Antonia racer tank?
find discounted items.
Open my latest created issue that has homepage content in its title to check if it is closed
Post a review of my recent reading "Love story" in the r/books with my comment "I cried".
List all opened issues that ask about OPT model related questions
Who gave 4 or 5 stars for phone cases from EYZUTAK
Make the LICENSE of byteblaze/accessible-html-content-patterns to Apache License
Create an issue in dotfiles repo with title "add support for oh-my-zsh". Assign the issue to Abishek. Set due date to be July 18 2033
Cancel order 299
Tell me the reasons why customers like Circe's products
Telll me the grand total of invoice 000000001.
Which US states border Vermont?
Create a milestone for the upcoming task of cleaning sensitive information starting on 2/16/2023 and ending on in 20 days
Tell me the count of comments that have received more downvotes than upvotes for the user who made the latest post on the photoshopbattles forum.
What is the minimum travel time by car from CMU to University of Pittsburgh?
Show the route from SCS CMU in Pittsburgh to the location where the Declaration of Independence and Constitution were signed
Get directions from Carnegie Science Museum to Hunt library CMU using walk options.
Get the date of the most recent canlled order
Ask for product recommendations for used iphone within a budget of $1000 in r/iphone
Check if the walmart in pittsburgh can be reached in one hour by car from 5600 fifth avenue
What is the zip code of Carnegie Mellon University?
Increase the price of this product by 15%
Find a subreddit focused on topics related to city Pittsburgh, and post my question, "places for new drivers to learn driving" there
Lookup orders that are suspected of being fraudulent
Upvote the newest post in deeplearning subreddit
Fill the "contact us" form in the site for a refund on the iphone case I bought, stating that it broke after just three days of use. Also, ensure to include the order number #180 and the product SKU. Don't submit yet, I will check.
Re-post the image of Thanksgiving turkey in this page to funny subreddit and note "from /f/pics"
Show me products under $199 in "furtiture with accent" category
Who is the operator of PIT airport
Find the page of the longest bridge in the Western hemisphere on the map.
Follow ['Jakub Klinkovský', 'Koushik', 'Vinta Chen'] on Gitlab
Open my latest created issue that has better in its title to check if it is closed
How much time does it take from Pittsburgh to Philadelphia by car?
Ask for advice about cheat in a subreddit for relations
Ask for product recommendations for running pants within a budget of $500 in r/sports
Draft a new marketing price rule for fall discount that offers $10 discount on checkout for all customers
Lookup orders that are completed
What is the top-1 best-selling brand in Quarter 1 2022
Post in books subreddit about what could machine learning help the correpong field.
Draft a new marketing price rule for Mother's day sale that offers $15 discount on checkout for all customers
create a repository named TODO that includes a README file with the links to the most active 10 DIY ideas on DIY subreddit?
Post in the most appropriate subreddit and ask for recommendations for used iphone products within a budget of $1000
Post in technology forum about what could open-source LLMs help the correpong field.
Notify Alex Thomas in their most recent pending order with message "Yo, your order will be shipped soon!"
Create a shipping report from 08/05/2022 to 03/01/2023
Open a new issue to discuss the implementation of dark mode
Post in DIY subreddit about what could midjourney help the correpong field.
Lookup orders that are canceled
Find the page of the colleges where The Chair was filmed in Pittsburgh on the map.
Find a subreddit focused on topics related to NYC, and post my question, "is car necessary" there
Cancel order 305
Show me products under $78 in "children dental care" category
How many reviews our shop received in May 2023?
Show me the name of the customers who have expressed dissatisfaction with Chloe tank
Post a review of my recent reading "To Kill a Mockingbird by Harper Lee" in the r/books with my comment "good book!".
Show me the name of the customer who is the most unhappy with Chloe tank
Find a GitLab repository related to chatGPT and make a Reddit post linking to it in a relevant subreddit
Get the total payment amount of the last 5 completed orders
Show me the name of the customer who is the most unhappy with Antonia racer tank
Preview the Magento Luma theme for my shop
Make a folder named car on the gimmiethat.space repo and include a file called urls.txt that consists of the links to the 5 most recent posts from cars.
Buy the highest rated product from the NS switch pouch category within a budget under 60.
Update the project site's title to "Welcome to my site"
What are the main criticisms of this product? Please extract the relevant sentences.
Upvote the newest post in explain like im 5 subreddit
Today is 6/12/2023. Tell me how many fulfilled orders I have over the past month, and the total amount of money I spent.
Show me the way from Carnegie Mellon University to the home stadium of Philadelphia 76ers in the 70th
We've received 378 brown Aero daily fitness tee in every size, please update the inventory.
List the name and number of commits of the top 3 contributors to metaseq repo, ranked by the number of commits?
I want to browse the products in the Men shoes category
What's the closest national park to Boston? How far is it to drive there?
Draft a new marketing price rule for Thanks giving sale that offers $40 discount on checkout for all customers
I will arrive Pittsburgh Airport soon. Provide the name of a Hyatt hotel in the vicinity, if available. Then, tell me the the minimal driving time to a supermarket from the hotel.
Find the walkway to the closest chain grocessory owned by a local business from 401 Shady Ave, Pittsburgh.
Give me the brand of the products that have 3 units left
Modify the address of order #299 to 456 Oak Avenue, Apartment 5B, New York, NY, 10001
Show me the shipping method for order number 187.
List the top 1 search terms in my store
How many commits did kilian make to a11yproject on 3/5/2023?
Change my reddit bio to "Seeking SDE positions"
Go to the merge request on wcag I have to review, find if the author of the merge request responded at the end, and reply "Thank you" if he did. Otherwise remind him with a simple @.
set the homepage URL on my GitLab profile to www.byteblaze.com
Get the order number of my most recent order
What is the color configuration of the artifical plants I bought Feb 2023
Draft a refund message via their "contact us" form for the phone case I bought March 2023. It broke after three days of use. The shop requires the order id, the reason and the amount to refund in the message. Don't submit yet
Koushik wants to check my dotfile configurations. Please invite him to the repo as a guest.
Set my gitlab status as Resting due to leg injury.
Post in the most appropriate subreddit and ask for recommendations for DIY toolkit products within a budget of $100
Create a private NodeJS repository called "web_agent_nodejs" using the right template to speed up development.
What do customers say about brush from sephora
I previously ordered some a cat t-shirt during 2022 and later cancelled. Can you reorder it for me?
Disable Karmen yoga pants from the site, they are facing some quality issues.
List out reviewers, if exist, who mention about complain of the customer service
Add a chair to my wish list.
Open an issue to ask their plans on adding Python 3.11 related resources in awesome-python.
How many reviews our shop received during 2022?
What's the total number of items sold in the most recent 5 orders?
Create a new forum named Karaoke, with a description of Place for Karaoke lovers, and include ['devices', 'setup'] in the sidebar?
Start a private project web_agent_android with Android template and add primer, convexegg, abishek as members
Update the project site's title to "GIVE ME SPACE"
Get directions from Carnegie Music Hall in NYC to Carnegie Mellon University using driving options.
Update the description of Selena Yoga Hoodie to highlight the real user positive reviews by quoting the comments
Give me the SKU of the products that have 1-3 units left
Post a notice on a virtual meetup for racing cars enthusiasts on Oct 21st in the nyc subreddit
How much did I spend on shopping at One Stop Market on November 2022? They gave me a 20% discount on the total amount for orders exceeding $200 in cash
Get the order number of my most recent cancelled order
Submit a merge request for a11yproject.com/redesign branch to be merged into markdown-figure-block branch, assign myself as the reviewer
Assign the issue regarding 404 in a11yproject to myself.
Create a new forum named sci_fi, with a description of A wild place for sci-fi enthusiasts, and include ['New', 'Classic', 'Movies', 'Post my novel', 'Random'] in the sidebar?
Upvote the newest post in future technology subreddit
Ask for product recommendations for running shoes within a budget of $500 in r/sports
Who else have access to my repo gimmiethat.space, show me their usernames
Invite Jakub Klinkovský and Benoît Blanchon as collaborator to gimmiethat.space repo
Find the walkway to the closest Trader Joe's from 401 Shady Ave, Pittsburgh.
Which US states border Pennsylvania?
Tell me the closest restaurant(s) to CMU Posner Hall
Pull up the description page of Piada restaurant near Pitt on Map
What is the duration required to first walk from Massachusetts Institute of Technology to Harvard University, and then drive to Boston Logan International Airport?
Which number to call for the customer service?
Post a notice on a virtual meetup for Harry Poter enthusiasts on July 8th in the books subreddit
Provide me with the complete names of Bluetooth headphones from Sony, and also share the price range for the available models
Where is the nearest In-N-Out to Upitts, and what is the walking distance to it?
Set up a new, empty repository with the name chatgpt_plugin?
Create a best sellers report from 05/01/2022 to 05/31/2023
How much I spend on 4/19/2023 on shopping at One Stop Market?
Change the page title of "404 Not Found" page on my site to "Bruh bro you clicked the wrong page".
Show me the "iphone 12 phone case" listings by name alphabetically.
Create a repo named nolan_academy_awards with movies that won Academy Awards by Christopher Nolan in a README file
Find the page of the place where Mr. Rogers was filmed on the map.
Pull up the description page of Carnegie Music Hall on Map
Draft a new marketing price rule for spring sale that offers a 20 percent discount site-wide for all customers
Open the thread of a trending post on the forum "pittsburgh" and subscribe.
List out reviewers, if exist, who mention about price being unfair
How much refund I should expect from my order canlled in April 2022, including shipping fee
Post "Good idea" for the merge request related to color ulitity in a11yproject.com project
How many commits did kilian make to a11yproject on 3/1/2023?
What is the phone number of Carnegie Mellon Café
Tell me the count of comments that have received more downvotes than upvotes for the user who made the latest post on the space forum.
What are the main criticisms of this product? Please extract the relevant sentences.
Post in history subreddit about what could diffusion model help the correpong field.
Add the following users to repo a11y-webring.club as developer: ['abisubramanya27', 'lahwaacz']
Find the walkway to the closest Japanese food market from 401 Shady Ave, Pittsburgh.
Show me the name of the customers who have expressed dissatisfaction with tanks products?
Create a folder named moive_space in gimmiethat.space repo. Within it, create a file named urls.txt that contains the URLs of the 5 most recent posts from the movies?
Draft a new marketing price rule for Pride Month that offers 45% off on all products for all customers
Tell me the total cost of my latest processing order?
Find the hotel around Carnegie Music Hall
Reduce the price of this product by 10%
Ask for advice about gift for birthday in a subreddit for relations
Draft an email to the shop owner via their contact us function for a coupon as I am a student
Draft a refund message via their "contact us" form for the kitchen organizer I bought around Feb 2023. It broke after three days of use. The shop requires the order id, the reason and the amount to refund in the message. Don't submit yet
Add a simple product named Lelelumon Yoga Mat with 42 in stock, available in size uni-size and color black, priced at $769.99
Add a new size XXXL to green Minerva LumaTech V-Tee
List products from PS4 accessories category by ascending price
Get the product name and discounted price (low to high) of the most recent completed order
Where is the nearest tea cafe to University of Pittsburgh, and what is the walking distance to it?
I have a lot of Nintendo Switch game cards now, help me find the best storage option to fit all 11 cards
Open my latest updated issue that has keyword "homepage content" in its title to check if it is closed
What is the price range of Canon photo printer in the One Stop Market?
Check if the amc theatre in pittsburgh can be reached in one hour by car from hobart street
Tell me when I last ordered my bread olive?
Open a new issue to discuss the implementation of default plugins for .zsh
Find the customer name and email with phone number 2065555555
Post in the most appropriate subreddit and ask for recommendations for must-have product in my life products within a budget of $30
I am at CMU Pittsburgh, how long it takes to drive to the nearest wendys
What are the main criticisms of this product? Please extract the relevant sentences.
List products from kids' bedding category by descending price
Reduce the price of this product by $5
Open my latest updated issue that has keyword "dependency" in its title to check if it is closed
Which US states border Connecticut?
Change the page title of "Home Page" page on my site to "This is the home page!! Leave here!!".
Create a repo named bafta_awards_nolan with movies that are nominated BAFTA Awards by Christopher Nolan in a README file
Show the most recent pending order
What is the price range of teeth grinding mouth guard in the One Stop Market?
Open the thread of a trending post on the forum "space" and subscribe.
Make the LICENSE of byteblaze/cloud-to-butt to MIT license.
Show the most recent cancelled order
I previously ordered some a TV stand sometime around sep 2022 and later cancelled. Can you reorder it for me?
Add HONGJ Hawaiian Beach Outfits Set for Mens, Summer Tropical Tree Printed Relaxed-fit Hawaii Shirts Shorts 2 Piece Suits to my wish list
What is the price range for products from Amazon basic?
Post a notice on a virtual meetup for Tears of Kingdom enthusiasts on Dec 15th in the games subreddit
Today is 6/12/2023. Tell me how many fulfilled orders I have over the past six month, and the total amount of money I spent.
Follow ['Jakub K', 'ghost', 'Benoît Blanchon'] on Gitlab
create a new group "x-lab" with members JonasVautherin, dilipchandima, dawiss1337, bmyun, DCMJY
Show me the command to clone the most stared Covid location tracker with SSH.
Display the list of issues in the kkroening/ffmpeg-python repository that have labels related to questions
List the email address of the top 3 contributors to Pytorch GAN repo, ranked by the number of commits?
Edit my post on Ted Lasso by adding a line to the body that says "Done watching. I love the renew!"
Tell me the total number of cancellations of the customer who has the most cancellations in the history
Given the following locations, ['Massachusetts Institute of Technology', 'Harvard University', 'Boston Logan International Airport'], what would be the optimal route to travel through them all in order to minimize total travel time? Please note the journey begins at the first place listed.
Post my question, "what is the recommended console to buy these days", in a subreddit where I'm likely to get an answer
Find a GitLab repository related to metaseq and make a Reddit post linking to it in a relevant subreddit
Pull up the description page of the Costco in Pittsburhg near a river on Map
What is the minimum travel time by car from CMU gates building to Schenley park?
DisLike all submissions created by jacyanthis in subreddit earthporn
Disable Ryker Tee Crew Neck from the site, they are facing some quality issues.
From my stay at La Quinta Inn near the airport, what's the estimated driving time to reach Upitt?
create a repository named Do it myself that includes a README file with the links to the most active 8 DIY ideas on DIY subreddit?
Add the product with the lowest per unit price from my open tabs to the shopping cart
Tell me the product SKUs in the most recent cancelled orders of the customer who has the most cancellations in the history
Get the total payment amount of the last 5 pending orders
Promote auth0/angular-storage to subreddit technology with the description from the repo itself.
What are the key aspects that the customers don't like about Pursuit Tone Band
yjlou wants to check my dotfile configurations. Please invite him to the repo as a guest.
Create a milestone for the upcoming task of merging all branches to main starting on March 15, 2044 and ending on March 30, 2044
What's the closest national park to Vinalhaven, ME? How long does it take to bike there?
Checkout merge requests assigned to me
Add the product with the lowest per unit price from my open tabs to the shopping cart
Buy the best rating product from "Men's shoe" category with at least 5 reviews and the product is least expensive
Tell me when I last ordered my conditioner?
Measure distance between CVS (closet one) and UPMC Shadyside by walking
Tell me the full address of all US international airports that are within a driving distance of 60 km to Niagara Falls
Show me the command to clone ChatGPT with SSH.
Compare the time for walking and driving route from Carnegie Science Center to Carnegie Mellon University
Buy the highest rated product from the Men clothing category within a budget above 50 but under 129.99.
Change the delivery address for my most recent order to 6726 McPherson Blvd, Pittsburgh, PA.
Change the delivery address for my most recent order to 4000 Forbes Ave, Pittsburgh, PA.
Create a discussion post about "long distance relationship" in a relevant subreddit and ask users for their opinions with the simple prompt, "your opinion"
How many reviews our shop received in Apr 2023?
Open the thread of a trending post on the forum "books" and subscribe.
Reply to the post with my comment "I am a big fan of the bookorg"
Create a discussion post about "Iphone 14" in a relevant subreddit and ask users for their opinions with the simple prompt, "your opinion"
Tell me the total cost of my latest non-cancelled order?
Add the following users to repo millennials-to-snake-people as reporter: ['yjlou', 'a11yproject']
I want to browse the products in the Woman clothing category
Open the thread of a trending post on the forum "machine learning" and subscribe.
How many reviews our shop received from the beginning of the shop?
Fork ChatGPT.
Find the hotel around CMU main campus
What is the size configuration of the picture frame I bought Sep 2022
List the name of the top 3 contributors to facebook's guide on building react apps repo, ranked by the number of commits?
I recently moved, my address is 222 Redwood Rise, Suite 300, Seattle, WA, 98101, update my information on OneStopShopping accordingly
Follow ['ghost', 'R1kk3r', 'Abishek'] on Gitlab
Display the list of issues in the umano/AndroidSlidingUpPanel repository that have labels related to BUG
Post my question, "what is the SOTA web navigation agent repo", in a subreddit where I'm likely to get an answer
Like all submissions created by UniversityofBath in subreddit IAmA
Show me the route and driving time from the city where my E-commerce customer Sophia Young lives to New York City
What is the price range for products from ugreen?
Tell me the reasons why customers like Olivia zip jacket
Modify the address of order #300 to 987 Cedar Court, Los Angeles, CA, 90012
Gather the titles of Nintendo Switch Fortnite Wildcat Console EU reviews with 3 stars and less rating from OneStopShop, and post them in the games subreddit under the title "real user feedback on Nintendo Switch Fortnite Wildcat Console EU"
Add this product to my wishlist
How much I spend in July 2022 on shopping at One Stop Market?
Create a folder named funny_pic in gimmiethat.space repo. Within it, create a file named urls.txt that contains the URLs of the 5 most recent posts from the memes?
Re-post the image of Wife's costume in this page to funny subreddit and note "from /f/pics"
Ask for product recommendations for noise-cancelling headphones within a budget of $200 in r/headphones
Create a milestone for the upcoming event of product launch starting on 1/16/2023 and ending on 1/30/2023
Increase the price of this product by $11.5
Ask for product recommendations for running shoes within a budget of $100 in r/sports
Check if the social security administration in pittsburgh can be reached in one hour by car from Carnegie Mellon University
What's the total number of items sold in the most recent 7 orders?
set the homepage URL on my GitLab profile to byteblaze.github.io
Find the page of the university that has most Turning Award winners on the map.
Tell me the email address of the contributor who has the most commits to branch gh-page
Star the top five most stared repos in Gitlab
Start a private project agi_index with HTML template and add Vinta Chen as members
Find the customer name and email with phone number 2137418080
Display the list of issues in the a11yproject/a11yproject.com repository that have labels related to help needed
Presents the monthly count of successful orders from Jan to December 2022 in MM:COUNT format
Modify the address of order #301 to 321 Birch Boulevard, Suite 200, Dallas, TX, 75201
Tell me the full name, gitlab account name, location and email address of the contributor who has the most commits to branch php52
Tell me the number of commits of the contributor who has the most commits to branch main
Show me the command to clone the best GAN python implementation with SSH.
Tell me the full address of all international airports that are within a driving distance of 30 km to Carnegie Art Museum
Go to the merge request on verification functions I have to review, find if the author of the merge request responded at the end, and reply "Thank you" if he did. Otherwise remind him with a simple @.
Show all customers
Check out the most recent open issues
Show me the command to clone Super_Awesome_Robot with SSH.
Fill the "contact us" form in the site for a refund on the remote controller I bought, stating that it broke after just three days of use. Also, ensure to include the order number #180 and the product SKU. Don't submit yet, I will check.
How much I spend each month from Jan to the end of March 2023 on shopping at One Stop Market?
Update order #301 with the DHL tracking number 239028439840
Set my gitlab status as Busy.
What is the zip code of Columbia University?
Ask for advice about break-up remedy in a subreddit for relations
What is the zip code of Chatham University?
Today is 6/12/2023. Tell me how many fulfilled orders I have over the past three days, and the total amount of money I spent.
Update the description of Lucia Cross-Fit Bra to highlight the real user positive reviews by quoting the comments
Disable lHelios Endurance Tank from the site, they are facing some quality issues.
Reduce the price of this product by 15%
Submit a merge request for the branch that implements the support of template strings to be merged into master branch, assign myself and Roshan as the reviewer
Rate my recent purchase of floor lamp with 5 stars, using my nickname Emma Lopez?
What are the top-2 best-selling product in 2022
Find the walkway to the closest grocessory owned by Amazon from 401 Shady Ave, Pittsburgh.
Find the walkway to the closest Target from 401 Shady Ave, Pittsburgh.
Where is the nearest Five Guys to 5700 Penn Ave, and what is the walking distance to it?
What is the estimated driving time between the city of Niagara Falls and the city of Yale University?
Tell me when I last ordered my body butter?
Show the least expensive shoe storage with a minimum storage capacity of 12 pairs.
Promote byteblaze/cloud-to-butt to subreddit LifeProTips with the description from the repo itself.
What is the minimum travel time by car from Animal Rescue League of Pittsburgh to Schenley park?
Tell me the status of my latest order and when will it arrive
Update the project site's title to "Hello"
Draft a refund message via their "contact us" form for the phone screen protector I bought March 2023. It broke after three days of use. The shop requires the order id, the reason and the amount to refund in the message. Don't submit yet
create a new group "crew" with members ASWATFZLLC, patrickhlauke, westurner, linkmatrix
Post my question, "places for new drivers to learn driving in pittsburgh", in a subreddit where I'm likely to get an answer
Create a repo named nolan_old_fans with movies directed by Christopher Nolan before 2010 in a README file
I previously ordered some a table lamp in May 2023 and later cancelled. Can you reorder it for me?
Please provide me with the complete product names of Oral B brush heads designed for children, along with their corresponding price range per brush
I am doing a market survey for one stop market, show me the most expensive product from competative swimwear category
Assign the issue regarding flash alert in primer design guide repo to myself.
Today is 6/12/2023. Tell me how many fulfilled orders I have over the past four month, and the total amount of money I spent.
I am at CMU Pittsburgh, how long it takes to drive to the nearest cold stone ice cream
Increase the price of this product by 10%
Update order #307 with the DHL tracking number 24353446464
Show me the walking distance from nearby hotels to CMU, Pittsburgh that take at most 5 minutes?
What is the top-1 best-selling product type in Jan 2023
I am doing a market survey for one stop market, show me the most expensive product from PS4 accessories category
Tell me the coordinates of Carnegie Mellon Café in DD format
Create an issue in cloud-to-butt repo with title "Let's keep the project alive". Assign the issue to myself. Set due date to be the end of Q1 2033
I recently moved, my address is 111 Magnolia Path, Atlanta, GA, 30303, update my information on OneStopShopping accordingly
Promote koush/AndroidAsync to subreddit funny with the description from the repo itself.
Draft a refund message via their "contact us" form for the PS3 remote controller I bought early 2023. It broke after three days of use. The shop requires the order id, the reason and the amount to refund in the message. Don't submit yet
Presents the monthly count of successful orders 01/2023-05/2023 in MM:COUNT format
How many commits did Eric make between Feb 2023 and May 2023?
Show me the "iphone 12 phone case" listings by price.
Show the least expensive switch card holder with a minimum storage capacity of 15 cards.
Change my reddit bio to "Freelance Web Developer"
What's the closest national park to the hometown of Stephen King? How long it takes to drive there?
Show me the way from Carnegie Mellon University to the home stadium of Boston home NBA team
Create a new forum named PlantsForCatParents, with a description of Cat parents & plan lovers, and include ['Cat friendly', 'Local vendors', 'Promotion', 'Toxic plants!'] in the sidebar?
Add a simple product named Energy-Bulk Women Shirt with 50 in stock, available in size S and color blue, priced at $60
Set up a new, empty repository with the name awesome_llm_reading?
What is the price range for products from Perricone MD?
List products from nutrition bars and drinks category by ascending price
Change my reddit bio to "Awesome Prompt Artist"
Change the page title of "Privacy Policy" page on my site to "No privacy policy is needed is this dystopian world".
Edit my post on The Night Agent by adding a line to the body that says "Done watching, pretty cool!"
Tell me the full names of the repositories where I made contributions and they got the least stars?
Get the purchase date and order id of the most recent pending order
Show me the customers who have expressed dissatisfaction with Olivia zip jacket?
Edit my post on Nvidia RTX 4090 by adding a line to the body that says "EDIT: This news aged well"
Thumbs down the top 1 post ever in gadgets.
Assign the issue regarding 404 in a11yproject to Roshanjossey.
Add DkRgVNY Lace Spcling Lingerie Womens Sexy Hollow Out Underwear Bodysuit One Piece Snap Crotch Clubwear Teddy Bodysuit to my wish list
Rate my recent purchase of Jiffy Corn Muffin Cornbread Mix with 4 stars, using my nickname ShoppingEmma?
Tell me the number of followers of the contributor who has the most commits to branch main
Today is 6/12/2023. Tell me how many fulfilled orders I have over the past year, and the total amount of money I spent.
Gather the titles of HORI 3D Surround Gaming Neckset reviews with 2 stars and less rating from OneStopShop, and post them in the games subreddit under the title "real user feedback on HORI 3D Surround Gaming Neckset"
Submit a merge request for build time debug branch to be merged into main branch, assign myself as the reviewer
Where is the nearest pharmacy from Carnegie Mellon I can walk within 20mins
Invite Abishek and Vinta as collaborator to a11yproject.com repo
Search for "xbox"
What are the key aspects that the customers don't like about Circe ice fleece
DisLike all submissions created by sirbarani in subreddit sports
Star the top one most stared repos in Gitlab
Tell me the the number of reviews that our store received by far that mention term "not useful"
create a new group "coding_friends" with members qhduan, Agnes-U
List all opened issues that report bugs
Open an issue to report the issue of connection refused in ChatGPT.
What is the top-1 best-selling product type in Quarter 1 2022
What are the main criticisms of this product? Please extract the relevant sentences.
Tell me the count of comments that have received more downvotes than upvotes for the user who made the latest post on the Showerthoughts forum.
List out reviewers, if exist, who mention about average print quality
Show me the "mouth night guard" listings by descending price.
List the top 3 search terms in my store
Check if the police station in pittsburgh can be reached in one hour by car from gates building at CMU
Submit a request to merge dialog-component branch into dialog branch, assign Carol as the reviewer
Upvote the newest post in books subreddit
Which customer(s) has completed the second most number of orders in the entire history?
Upvote the newest post in DIY subreddit
I will arrive Pittsburgh Airport soon. Provide the name of a Hyatt hotel in the vicinity, if available. Then, tell me the the shortest walking time to a supermarket from the hotel.
List the name of the top 3 contributors to prime/design repo, ranked by the number of commits?
Add the following users to my time tracking tool as guest: ['yjlou']
How much I spent on food shopping during from mid Jan to the end Jan 2023
Pull up the description page of Carnegie Mellon University on Map
Change the page title of "Enable Cookies" page on my site to "Cookie monster coming to your place".
Today is 3/15/2023, generate a sales order report for last month
Add Tide PODS Spring Meadow Scent HE Turbo Laundry Detergent Pacs, 81 Count to my wish list
Rate my recent purchase of PS3 Remote Controllers with 3 stars, using my nickname GamingEmma?
How many commits did Eric make to a11yproject on 3/2?
Give me the product names and the sizes of the products that have 2-3 units left
Show the most recent completed order
Show the most recent processing order
Open my latest updated issue that has keyword "feature" in its title to check if it is closed
Tell me the total cost of my latest cancelled order?
Reduce the price of yellow shirts from Gwyn Endurance in all size below L by 15%
Like all submissions created by Don_Gato1 in subreddit new york
Reply to the manager of the website in this post with "thanks! I am a big fan of your website."
Which customer has completed the most number of orders in the entire history?
What is the date when I made my first purchase on this site?
Tell me who has made the most contributions, in terms of number of commits, to the AndroidSlidingUpPanel project
Search for "usb wifi"
How much I spent on food-related shopping during March 2023
What are the top-3 best-selling product in Jan 2023
Create an issue in empathy-prompts repo with title "Integrating LLMs for better prompts". Assign the issue to Roshanjossey. Set due date to be the beginning of Q2 2033
Create a repo named nolan_followers with career timeline of Christopher Nolan in a README file
How much I spent on hair care and hair style shopping during Jan 2023
Submit a merge request for dialog-component branch to be merged into bump-doctocat branch, assign primer as the reviewer
Assign the issue regarding linking to an accessibility statement in a11y-webring.club to Rohan.
Tell me when I last ordered my toothpaste?
Create a milestone for the upcoming task of adding a new branch for zsh comprehensive support starting on 5/1/2044 and ending on in 20 days