CIS 548讲解 辅导C++程序语言 Turtles in Time

- 首页 >> C/C++编程

CIS 548 - Project 0 - Spring 2019

CIS 548 Project 0

penn-shredder: Turtles in Time

“Tonight I dine on turtle soup!”

shredder, Teenage Mutant Ninja Turtles the Animated Series

CIS548 Staff

DUE: Friday, Jan. 25th @ 10pm

Directions

This is an individual assignment. You may not work with others. Please regularly check Piazza throughout

this course for coding style, and project specification clarifications. You are encouraged to version control

your code, but do not work in public GitHub repositories. Please avoid publishing this project at

anytime, even post-submission, to observe course integrity policies.

Overview

In this assignment you will implement a basic shell named penn-shredder that will restrict the run time

of executed processes. Your shell will read input from the users and execute it as a new process, but if

the process exceeds a timeout, it will be killed. You will complete this assignment using only a specified

set of system calls and without the use of standard C library functions (e.g., printf(3), fgets(3),

system(3), etc.). You may freely use any functions in string.h. Please be aware of the accuracy of

your output contents, especially for spacing as this year’s autograder will take into consideration whitespace

for correctness.

1 Specification

At its core, a shell is a simple loop. Upon each iteration, the shell: prompts the user for a program to run;

executes the program as a separate process; waits for the process to finish; and then, re-prompts the user

completing the loop. The shell exits the loop when the user provides EOF (end of file) (i.e., Ctrl-D). Your

shell, penn-shredder, will do all of that, but with a slight twist.

penn-shredder takes a command line option for specifying a program timeout. If any process runs for

longer than the specified timeout, penn-shredder will kill the program and report to the user his snide catchphrase,

“Bwahaha ... tonight I dine on turtle soup”. The exact whitespace usage in the catch-phrase is

necessary for the autograder to correctly grade your assignment.

The following is an example of input that should invoke the catch-phrase:

Compiled 2019-01-16 14:08

CIS 548 - Project 0 - Spring 2019

bash# ./penn-shredder 10

penn-shredder# /bin/cat

Bwahaha ... tonight I dine on turtle soup

penn-shredder# /bin/pwd /home/yourusername

penn-shredder#

Here, penn-shredder was executed with a timeout argument of 10 seconds, and any program that runs

longer than 10 seconds will be killed (or shredded). The cat program executed without arguments runs

indefinitely, and thus was killed after exceeding the timeout. Conversely, pwd returns quickly and was not

killed, upsetting shredder and his henchman.

1.1 read, fork, exec, wait, and repeat!

As described previously, a shell is just a loop performing the same procedure over and over again. Essentially,

that procedure can be completed with these four system calls:

read(2) : read input from stdin into a buffer

fork(2) : create a new process that is an exact copy of the current running program.

execve(2) : replace the current running program with another

wait(2) : wait for a child process to finish before proceeding

Your program, in pseudo-code, should look roughly like this:

while(1){

read(cmd, ...);

pid = fork();

if(!pid){

execve(cmd,...);

}else{

wait();

}

}

This may seem simple, but there are many, many things that can go wrong. You should spend some time

carefully reading the entire man page for all four of these systems calls. 1 To do so, in a terminal type:

bash# man 2 read

where 2 specifies the manual section. If you do not specify the manual section, you may get information for

a different read command.

1Hint: You may want to pay attention to wait(2). Be sure you understand how the waitpid (or wait) is called and

checked in the example. Every year many students are surprised by its usage after grading.

CIS 548 - Project 0 - Spring 2019

1.2 Timing is Everything

To time a running program your shell will employ the alarm(2) system call. The alarm(2) system call

simply tells the operating system to deliver a SIGALRM signal after a specified time. The SIGALRM signal

must be handled by your shell; otherwise, your shell will exit.

To handle a signal a signal handing function must be registered with the operating system via the

signal(2) system call. When the signal is delivered, the operating system will preempt your shells

current operations (e.g., waiting for the program to finish) and this program on your own to best understand

signal handling. Here are some questions you could try and answer: What happens if you remove the call to

signal(2)? What happens if you provide different arguments to alarm(2)? What happens if you use

the sleep(3) function instead of the busy wait?2 You should also spend time carefully reading the entire

man page for these systems calls and references in APUE regarding signals and signal handling.

1.3 Hit the kill-switch!

The kill(2) system call delivers a signal to a process. Despite its morbid name, it will only kill (or

terminate) a program if the right signal is delivered. One such signal will always do just that, SIGKILL.

The SIGKILL signal has the special property that it cannot be handled or ignored, so no matter the program

your shell executed it must heed the signal.3

1.4 Prompting and I/O

In programming your shell, you will only use system calls and nothing from the C standard library (except

string.h). This includes input and output functions like printf(3) and fgets(3). Instead you will

use the read(2) and write(2) system calls. Consult the manual pages for these functions’ specification.

Your shell must prompt the user for input as follows:

penn-shredder#

Please note that the prompt has a whitespace after the octothorpe, so if a user begins typing, they would see:

penn-shredder# somestringhere

Following the prompt, your program will read input from the user. You may truncate user input to 1024

bytes, but your shell must gracefully handle input longer than that. By graceful, we mean providing some

kind of reasonable behavior such as implicitly truncating to 1024 bytes and properly flushing extra bytes.

You may assume that the 1024th byte is the null-termination byte.

1.5 Argument Restrictions

Your shell is not required to parse additional program arguments. That is, a user can provide any number of

arguments, but your shell may ignore them, simply executing the base program. However, your shell must

still execute the program even if arguments are provided. For example:

2

1pt Extra Credit: What does the man page have to say about mixing calls to sleep(3) and alarm(2)? Answer this

question in your README: it should include a sample program.

3

1pt Extra Credit: What other signal(s) have this property? Which signals will terminate the program if unhandled? Which

man page has this information? Answer these questions in your README

CIS 548 - Project 0 - Spring 2019

penn-shredder# /bin/sleep 100

sleep: missing operand

Try ‘sleep --help’ for more information.

Due to this restriction, you may find it useful to write separate testing programs that exit after a specified

time (perhaps based on the SIGALRM example above). Feel free to submit these test programs with your

shell.4

1.6 Executing a Program

Your shell may only use the execve(2) system call to execute a program. This system call instructs the

operating system to replace the current running program – that would be the child of your shell – with the

specified program. Please refer to the manual for more details.

The execve(2) system call is the base of a larger collection of functions; however, you may not use

those other functions. Explicitly, you may not use execl(3), execv(3)), or any other function listed in

the exec(3) manual page. As a result, the user of your shell must specify the entire path to a program to

execute it. To learn where a program lives (i.e., its path), use the which program in your non-penn-shredder

shell.

Forking, like all other system calls, is computationally expensive. You should not call a system call if

it’s simple to determine from internal state that the system call would do nothing or fail, and you should

monitor the status of any forked child process.

1.7 Ctrl-C behavior

Ctrl-C (SIGINT) is a very helpful signal often used from the shell to stop the current running program.

Child processes started from penn-shredder should respond to Ctrl-C by following their normal behavior

on SIGINT, but penn-shredder itself should not exit (even when Ctrl-C is invoked without a child process

running).

bash# ./penn-shredder

penn-shredder# /bin/cat

Cpenn-shredder# ?C/bin/pwd

<Your working directory>

penn-shredder#

1.8 Arguments to penn-shredder

Although within penn-shredder there is no requirement to handle arguments, the penn-shredder program

itself must take an optional argument: the execution timeout. If no argument is provided, then penn-shredder

imposes no time restriction on executing processes. For example, an optional argument of 10 results in a

timeout of 10 seconds:

bash# ./penn-shredder 10

4

5pt Extra Credit: For extra credit, write a simple parser that will handle arbitrary arguments and pass them to execve

appropriately. Because it’d make it far too easy, for this extra credit opportunity you cannot use any function from the C standard

library. This includes everything from string.h, so don’t even think about using strdup(3) or strtok(3) or their variants.

CIS 548 - Project 0 - Spring 2019

Omitting the optional argument results in no timeouts.

As mentioned before, with the exception of string.h you cannot use anything from the C standard

library. However, for command line argument int parsing you may make a single call to atoi(3) or

strtol(3) to convert command line input into an integer. You should check for errors in this conversion,

e.g., when a user provides a character instead of a number.5

2 Error Handling

All system call functions you use will report errors via the return value. As a general rule, if the return value

is less than 0, an error occurred and errno is set appropriately. You must check your error conditions

and handle or report errors. To expedite the error checking process, we allow you to use perror(3)

library function. Although you are allowed to use perror, it does not imply you should report all errors at

an extreme verbosity. Instead, try and strike a balance between sparse and verbose reporting.

3 Code Organization

Sane code organization is critical for all software: Your code should not be all in one giant penn-shredder.c

file. Rather, you should create reasonable modules and expose the relevant interfaces and constants in .h

files. Make sure that your Makefile builds all relevant modules and links them into the final penn-shredder

executable.

You code should adhere to DRY (Don’t Repeat Yourself). If you are writing code that is used in more

than one place you should write a function or a macro. See the Piazza Post for general coding conventions

and tips.

4 Memory Errors

You are required to check your code for memory errors. This is a nontrivial task, but an extremely important

one. Code with memory leaks and memory violations will be deducted. Fortunately, there is a very nice

tool valgrind that is available to help you. It should be installed on the SpecLab cluster. valgrind

is a tool and not a solution. You must still find and fix any bugs that valgrind locates, but there is no

guarantee it will find all memory errors in your code, especially those that rely on user input!

5 Acceptable Library Functions

In this assignment you may use only the following system calls:

execve(2)

fork(2)

wait(2)

read(2)

5

3pt Extra Credit: For extra credit, implement your own version of atoi(3) that can handle arbitrary numbers within the 32

bit width of an integer.

CIS 548 - Project 0 - Spring 2019

write(2)

signal(2)

alarm(2)

kill(2)

exit(2)

And you may use these non-system calls:

malloc(3) or calloc(3)

free(3)

perror(3) for reporting errors

atoi(3) or strtol(3) but just once!

string(3) for string utility functions

Using any other library function than those specified above will affect your grade on this assignment. If you

use the system(3) library function, you will receive a ZERO on this assignment.

6 Developing Your Code

In general, there are two environments for you to develop projects in this course: (1)virtual machine

powered by Vagrant or (2) SpecLab servers for remote developing. We recommend the first environment.

For the first choice, we provide a Vagrantfile for your convenience to create a course-standardized virtual

machine. You can find the set-up instruction and video on Piazza. We highly recommend you to use it as

your developing environment because all grading will be done on it. Please do not develop on macOS as

there are significant differences between macOS and Unix variants such as Ubuntu Linux. If you decide

to develop on a different machine anyway, you must compile and test your penn-shredder on the coursestandardized

virtual machine to ensure your penn-shredder runs as you expect during grading.

The course-standardized virtual machine is powered by Vagrant. Vagrant has a synced folder feature,

which can automatically sync your files to and from the guest machine. It provides the flexibility so that

you can develop your code on the host machine with your familiar IDEs or tools instead of editing in

the VM through a terminal-based editor or over SSH. Read the following link to learn more, https:

//www.vagrantup.com/intro/getting-started/synced_folders.html.

However, there is a case-sensitive issue in the synced folder you need to pay attention to. It is known

that Windows and macOS are case insensitive while Linux is case sensitive. The course-standardized virtual

machine is basically a Ubuntu system. In this virtual machine, the file system is case-sensitive except that

the synced folder (/vagrant) has the same case-sensitive with the host machine. To be more specific, if

your host machine is any Linux distribution, then your virtual machine has no case sensitive issue. However,

if your host machine is Windows or macOS, your synced folder /vagrant is case-insensitive and the other

directories are case-sensitive. To handle case-sensitive issue and avoid annoying compiling bugs when you

collaborate with others, we highly recommend to name all your .h and .c files in camelCase. A good

CIS 548 - Project 0 - Spring 2019

way to check this is to copy your project to folders other than the shared folder in the vagrant virtual machine,

and run the code there.

For the second choice, SpecLab will be available to you as CIS-548 students, however, we discourage

its usage unless your personal machine has super, super weak performance specifications such as a single

core machine with 2GB of RAM or less. SpecLab is a collection of older desktops that run the same Linux

variant as eniac, and most importantly, you can crash them as much as you want without causing too much

chaos. You access SpecLab remotely over ssh. There are roughly 10 SpecLab machines up at any time.

When you are ready to develop, choose a random number between 01 and 50, or so, and then issue this

command:

bash# ssh specDD.seas.upenn.edu

where DD is replaced with the number you chose. If that machine is not responding, then add one to that

number and try again. Not all SpecLab machines are currently up, but most are. You must be on SEASnet

to directly ssh into SpecLab. The RESnet (your dorms) is not on SEASnet, but the machines in the Moore

computer lab are. If you are not on SEASnet, you may still remotely access SpecLab by first ssh-ing into

eniac and then ssh-ing into SpecLab.

Do not develop your code on eniac. CIS 548 students are notorious for crashing eniac in creative ways,

normally via a “fork-bomb.” This always seems to happen about two hours before a homework deadline,

and the result is chaos, not just for our class, but for everyone else using eniac.

Students who are caught running penn-shredder directly on eniac will be penalized.

7 What to turn in

You must provide each of the following for your submission, no matter what. Failure to do so may reflect

in your grade.

1. README file. In the README you will provide

Your name and eniac username

A list of submitted source files

Extra credit answers

Compilation Instructions

Overview of work accomplished

Description of code and code layout

General comments and anything that can help us grade your code

2. Makefile. We should be able to compile your code by simply typing make. Your executable should

be named penn-shredder. If you do not know what a Makefile is or how to write one ask one

of the TAs for help or consult one the many on-line tutorials.

3. Your code. You may think this is a joke, but at least once a year, someone forgets to include it.

CIS 548 - Project 0 - Spring 2019

8 Submission

Submission should be done through Canvas. Please compress your files via the following:

tar -cvzf yourpennkey-project0.tar.gz inputfile1 inputfile2...

As explained above, please remember to include your README, your Makefile, any header files you

have written, along with your main penn-shredder C program. Canvas will accept multiple submissions but

we will grade only the most recent submission submitted before the submission deadline.

9 Grading Guidelines

10% Documentation

10% Proper use of system calls

10% Peer review about general code design

70% Functionality

Please note that general deductions may occur for a variety of programming errors including memory

violations, lack of error checking, poor code organization, etc. Also, do not take the documentation lightly,

it provides us (the graders) a road-map of your code, and without it, it is quite difficult to grade the implementation.

You may use any comment style you wish, but we highly suggest using DOxygen style comments as

they promote fully documenting function inputs, outputs, and error cases. A useful guide to this comment

style can be found here: https://www.stack.nl/ dimitri/doxygen/manual/docblocks.html

This year, we are adding a peer review component as a part of your evaluation. After the submission,

you will evaluate another student’s code based on a given rubric using a profiling tool such as valgrind

and sanitizers. Keep a lookout on Piazza for more details about this after your submission.

Your programs will be graded on the course-standardized virtual machines, and must execute as

specified there. Although you may develop and test on your local machine, you should always test that your

program functions properly there.

An autograder will be run against your submission so whitespace matters. Specifically, be sure to

match the ”Bwahaha ... tonight I dine on turtle soup” prompt and include the single whitespace after the

octothorpe.

10 Attribution

This is a large and complex assignment, using arcane and compactly documented APIs. We do not expect

you to be able to complete it without relying on some outside references. That said, we do expect you to

struggle a lot, which is how you will learn about systems programming, by doing things yourself.

The primary rule to keep you safe from plagiarism/cheating in this project is to attribute in your documentation

any outside sources you use. This includes both resources used to help you understand the

concepts, and resources containing sample code you incorporated in your shell. The course text and APUE

need only be cited in the latter case. You should also use external code sparingly. Using most of an example

CIS 548 - Project 0 - Spring 2019

from the pipe(2) man page is ok; using the ParseInputAndCreateJobAndHandleSignals()

function from Foo’s Handy Write Your Own Shell Tutorial is not (both verbatim, and as a closely followed

template on structuring your shell).


站长地图