S081 Labs
Utilities
Install xv6 on Catalina
配置 S081
实验环境的时候,官网对于 macOS 的安装方法 brew install riscv-tools
等包管理无法在 Catalina 及以下的系统版本上安装成功,需要进行手动编译。编译项如下
- RISC-V工具链: 包括一系列交叉编译的工具,用于把源码编译成机器码,如
gcc,binutils,glibc
等 - QEMU模拟器: 用于在 X86 机器上模拟 RISC-V 架构的 CPU
- xv6源码: xv6 操作系统源码
一、RISC-V toolchain
下载后在源码根目录进行编译,具体参考官方文档。编译大约需要30min:
1 | git clone --recursive https://github.com/riscv/riscv-gnu-toolchain |
在macOS catalina版本下进行./configure时提示缺少GNU的 awk 和 sed: configure: error: GNU awk not found,手动安装即可:
1 | brew install gawk |
安装完成后需配置环境变量,与上一步设置的安装路径一致。Mac下的环境配置文件是/.bash_profile(Linux下为/.bashrc或~/.profile),操作如下:
1 | vim ~/.bash_profile #打开配置文件 |
此时在命令行输入 riscv64-unknown-elf-gcc -v
,如果能显示版本信息则代表安装成功。
需要注意的是,在编译的过程中,会出现报错,需要手动在 terminal.c
和 rltty.c
中添加 #include <sys/ioctl.h>
头文件。
二、QEMU
编译生成的 risc-v 机器码需要通过模拟 cpu 执行,因此需要从qemu官网下载指定版本的源码并编译:
1 | wget https://download.qemu.org/qemu-4.1.0.tar.xz #下载后解压并进入目录 |
3 xv6
进入 MIT 的官网 https://pdos.csail.mit.edu/6.S081/2020/
, 键入以下命令
1 | git clone git://g.csail.mit.edu/xv6-labs-2020 |
即进入实验环境
Boot xv6 [Easy]
The xv6-labs-2020 repository differs slightly from the book’s xv6-riscv; it mostly adds some files. If you are curious look at the git log:
1 | git log |
The files you will need for this and subsequent lab assignments are distributed using the Git version control system. Above you switched to a branch (git checkout util) containing a version of xv6 tailored to this lab. To learn more about Git, take a look at the Git user’s manual, or, you may find this CS-oriented overview of Git useful. Git allows you to keep track of the changes you make to the code. For example, if you are finished with one of the exercises, and want to checkpoint your progress, you can commit your changes by running:
1 | git commit -am 'my solution for util lab exercise 1' |
You can keep track of your changes by using the git diff command. Running git diff will display the changes to your code since your last commit, and git diff origin/util will display the changes relative to the initial xv6-labs-2020 code. Here, origin/xv6-labs-2020
is the name of the git branch with the initial code you downloaded for the class.
These are the files that mkfs
includes in the initial file system; most are programs you can run. You just ran one of them: ls
. If you type ls
at the prompt, you should see output similar to the following:
xv6 has no ps
command, but, if you type Ctrl-p, the kernel will print information about each process. If you try it now, you’ll see two lines: one for init
, and one for sh
.
To quit qemu type: Ctrl-a x
.
Grading and hand-in procedure
You can run make grade
to test your solutions with the grading program. The TAs will use the same grading program to assign your lab submission a grade. Separately, we will also have check-off meetings for labs (see Grading policy).
The lab code comes with GNU Make rules to make submission easier. After committing your final changes to the lab, type make handin to submit your lab. For detailed instructions on how to submit see below.
sleep (easy)
Implement the UNIX program
sleep
for xv6; yoursleep
should pause for a user-specified number of ticks. A tick is a notion of time defined by the xv6 kernel, namely the time between two interrupts from the timer chip. Your solution should be in the fileuser/sleep.c
.
Preparation
- read Chapter 1 of the xv6 book.
- Look at some of the other programs in
user/
(e.g.,user/echo.c
,user/grep.c
, anduser/rm.c
) to see how you can obtain the command-line arguments passed to a program. - If the user forgets to pass an argument, sleep should print an error message.
- The command-line argument is passed as a string; you can convert it to an integer using
atoi
(see user/ulib.c). - Use the system call
sleep
. - See
kernel/sysproc.c
for the xv6 kernel code that implements thesleep
system call (look forsys_sleep
),user/user.h
for the C definition ofsleep
callable from a user program, anduser/usys.S
for the assembler code that jumps from user code into the kernel forsleep
. - Make sure
main
callsexit()
in order to exit your program. - Add your
sleep
program toUPROGS
in Makefile; once you’ve done that,make qemu
will compile your program and you’ll be able to run it from the xv6 shell. - Look at Kernighan and Ritchie’s book The C programming language (second edition) (K&R) to learn about C.
Presentation
Run the program from the xv6 shell:
1 | $ sleep 10 |
Your solution is correct if your program pauses when run as shown above. Run make grade to see if you indeed pass the sleep tests.
Note that make grade runs all tests, including the ones for the assignments below. If you want to run the grade tests for one assignment, type:
1 | ./grade-lab-util sleep |
This will run the grade tests that match “sleep”. Or, you can type:
1 | make GRADEFLAGS=sleep grade |
pingpong (easy)
Write a program that uses UNIX system calls to ‘’ping-pong’’ a byte between two processes over a pair of pipes, one for each direction. The parent should send a byte to the child; the child should print “
: received ping”, where is its process ID, write the byte on the pipe to the parent, and exit; the parent should read the byte from the child, print “ : received pong”, and exit. Your solution should be in the file user/pingpong.c
.
Preparation
- Use
pipe
to create a pipe. - Use
fork
to create a child. - Use
read
to read from the pipe, andwrite
to write to the pipe. - Use
getpid
to find the process ID of the calling process. - Add the program to
UPROGS
in Makefile. - User programs on xv6 have a limited set of library functions available to them. You can see the list in
user/user.h
; the source (other than for system calls) is inuser/ulib.c
,user/printf.c
, anduser/umalloc.c
.
Presentation
Run the program from the xv6 shell and it should produce the following output:
1 | pingpong |
Your solution is correct if your program exchanges a byte between two processes and produces output as shown above.
primes (moderate)/(hard)
Write a concurrent version of prime sieve using pipes. This idea is due to Doug McIlroy, inventor of Unix pipes. The picture halfway down this page and the surrounding text explain how to do it. Your solution should be in the file
user/primes.c
.Your goal is to use
pipe
andfork
to set up the pipeline. The first process feeds the numbers 2 through 35 into the pipeline. For each prime number, you will arrange to create one process that reads from its left neighbor over a pipe and writes to its right neighbor over another pipe. Since xv6 has limited number of file descriptors and processes, the first process can stop at 35.
Preparation
- Be careful to close file descriptors that a process doesn’t need, because otherwise your program will run xv6 out of resources before the first process reaches 35.
- Once the first process reaches 35, it should wait until the entire pipeline terminates, including all children, grandchildren, &c. Thus the main primes process should only exit after all the output has been printed, and after all the other primes processes have exited.
- Hint:
read
returns zero when the write-side of a pipe is closed. - It’s simplest to directly write 32-bit (4-byte)
int
s to the pipes, rather than using formatted ASCII I/O. - You should create the processes in the pipeline only as they are needed.
- Add the program to
UPROGS
in Makefile.
Presentation
Your solution is correct if it implements a pipe-based sieve and produces the following output:
1 | primes |
find (moderate)
Write a simple version of the UNIX find program: find all the files in a directory tree with a specific name. Your solution should be in the file
user/find.c
.
Preparation
- Look at user/ls.c to see how to read directories.
- Use recursion to allow find to descend into sub-directories.
- Don’t recurse into “.” and “..”.
- Changes to the file system persist across runs of qemu; to get a clean file system run make clean and then make qemu.
- You’ll need to use C strings. Have a look at K&R (the C book), for example Section 5.5.
- Note that == does not compare strings like in Python. Use strcmp() instead.
- Add the program to
UPROGS
in Makefile.
Presentation
Your solution is correct if produces the following output (when the file system contains the files b
and a/b
):
1 | echo > b |
xargs (moderate)
Write a simple version of the UNIX xargs program: read lines from the standard input and run a command for each line, supplying the line as arguments to the command. Your solution should be in the file
user/xargs.c
.
Preparation
- Use
fork
andexec
to invoke the command on each line of input. Usewait
in the parent to wait for the child to complete the command. - To read individual lines of input, read a character at a time until a newline (‘\n’) appears.
- kernel/param.h declares MAXARG, which may be useful if you need to declare an argv array.
- Add the program to
UPROGS
in Makefile. - Changes to the file system persist across runs of qemu; to get a clean file system run make clean and then make qemu.
xargs, find, and grep combine well:
1 | find . b | xargs grep hello |
will run “grep hello” on each file named b in the directories below “.”.
To test your solution for xargs, run the shell script xargstest.sh. Your solution is correct if it produces the following output:
1 | sh < xargstest.sh |
You may have to go back and fix bugs in your find program. The output has many $
because the xv6 shell doesn’t realize it is processing commands from a file instead of from the console, and prints a $
for each command in the file.
Presentation
The following example illustrates xarg’s behavior:
1 | echo hello too | xargs echo bye |
Note that the command here is “echo bye” and the additional arguments are “hello too”, making the command “echo bye hello too”, which outputs “bye hello too”.
Please note that xargs on UNIX makes an optimization where it will feed more than argument to the command at a time. We don’t expect you to make this optimization. To make xargs on UNIX behave the way we want it to for this lab, please run it with the -n option set to 1. For instance
1 | echo "1\n2" | xargs -n 1 echo line |
Optional challenge exercises
- Write an uptime program that prints the uptime in terms of ticks using the
uptime
system call. (easy) - Support regular expressions in name matching for
find
.grep.c
has some primitive support for regular expressions. (easy) - The xv6 shell (
user/sh.c
) is just another user program and you can improve it. It is a minimal shell and lacks many features found in real shell. For example, modify the shell to not print a$
when processing shell commands from a file (moderate), modify the shell to support wait (easy), modify the shell to support lists of commands, separated by “;” (moderate), modify the shell to support sub-shells by implementing “(“ and “)” (moderate), modify the shell to support tab completion (easy), modify the shell to keep a history of passed shell commands (moderate), or anything else you would like your shell to do. (If you are very ambitious, you may have to modify the kernel to support the kernel features you need; xv6 doesn’t support much.)